
Calhoun 

Iniiiiuiiortfl Arthivcof (he Navjl Pwigndualt School 


Calhoun: The NPS Institutional Archive 
DSpace Repository 



Theses and Dissertations 


1. Thesis and Dissertation Collection, all items 


1994-06 

Double Eulerian cycles on de Bruijn digraphs 

Krahn, Gary William 

Monterey, California. Naval Postgraduate School 


http://hdl.handle.net/10945/42901 


This publication is a work of the U.S. Government as defined in Title 17, United 
States Code, Section 101. Copyright protection is not available for this work in the 
United States. 

Downloaded from NPS Archive: Calhoun 



DUDLEY 

KNOX 

LIBRARY 


htt p ://w w w. n ps.e-du/l ib ra ry 


Calhoun is the Naval Postgraduate School's public access digitaI repository for 
research materials and institutional publications created by the NPS community. 
Calhoun is named for Professor of Mathematics Guy K. Calhoun, NPS's first 
appointed —and published —scholarly author. 

Dudley Knox Library / Naval Postgraduate School 
411 Dyer Road / 1 University Circle 
Monterey, California USA 93943 






AD-A283 334 

NMML POSTGRADUATE SCHOOL 
Monterey, CatMa 



DISSERTATION 


Double Eulerian Cycles on de Bruijn Digraphs 


by 


Gary William Krahn 


June 1994 


Dissertation Supervisor: 


Harold Fredricksen 


Approved for public release; distribution unlimited 




OTIC 


QUALITY OTBPEC2BD, 


94 8 12 "" 051 










REPORT DOCUMENTATION PAGE 


Form Approved OMB Np. 0704 


Public reponinj burden for Ihii collection of information it estimated to average 1 hour per response, including the time for reviewing instruction, 
searching existing data sources, gathering and maintaining the data needed, and completing and reviewing the collection of information. Send comments 
regarding this burden estimate or any other aspect of this collection of information, including suggestions for reducing this burden, to Washingon 
headquarters Services, Directorate for Information Operations and Reports, 121) Jefferson Davis Highway, Suite 1204, Arlington, VA 22202-4302, and 
to the Office of Management and Budget. Paperwork Reduction Project 10704-0188) Washington DC 20)03. 


1. AGENCY USE ONLY (Leave blank) 2. REPORT DATE 

June 1994 


3. REPORT TYPE AND DATES COVERED 
Ph.D. Dissertation 


5. FUNDING NUMBERS 


12b. DISTRIBUTION CODE 


4. TITLE AND SUBTITLE 

DOUBLE EULERIAN CYCLES ON DE BRUIJN DIGRAPHS 


6. AUTHOR(S) Gary William Krahn 


7. PERFORMING ORGANIZATION NAME(S) AND ADDRESSES) 
Naval Postgraduate School 
Monterey CA 93943-5000 


9. SPONSORING/MONITORING AGENCY NAME(S) AND ADDRESS(ES) 


8. PERFORMING 
ORGANIZATION 
REPORT NUMBER 


10 . 

SPONSORING/MONITORING 
AGENCY REPORT NUMBER 


li. SUPPLEMENTARY NOTES The views expressed in this thesis are those of the author and do not 
reflect the official policy or position of the Department of Defense or the U.S. Government. 


12a. DISTRIBUTION/AVAILABILITY STATEMENT 

Approved for public release; distribution is unlimited. 


13.ABSTRACT (maximum 200 words) 

A binary de Bruijn sequence has the property that every st-tuple is distinct on a given period of length 2". An efficient 
algorithm to generate a class of classical de Bruijn sequences is given based upon the distance between cycles within the 
Good - de Bruijn digraph. The de Bruijn property on binary sequences is shown to be a randomness property of the 
ZERO and ONE run sequences. Utilizing this randomness we find additional new structure in de Bruijn sequences. We 
analyze binary sequences that are not de Bruijn but instead possess the sufficient structure so that every distinct binary 
n-tuple can be systematically "combed” out of the sequence. These complete or nonclassical de Bruijn sequences are a 
generalization of the well-known de Bruijn cycle. 

Our investigation focuses on binary sequences, called double Eulerian cycles, that define a cycle along a graph (digraph) 
visiting each edge (arc) exactly twice. A new algorithm to generate a class of double Eulerian cycles on graphs and 
digraphs is found. Double Eulerian cycles along the binary Good - de Bruijn digraph are partitioned by the run structure 
of their defining sequences. This partition allows for a statistical analysis to determine the relative size of the set of 
complete cycles defined by the sequences we study. A measure that categorizes double Eulerian cycles along graphs 
(digraphs) by the distance between the two visitations of each edge (arc) is provided. An algorithm to generate double 
Eulerian cycles of minimum measure is given. 


14. subiect TERMS de Bruijn Cycles, Eulerian Cycles, Shift Register, Binary 
Sequences, Digraphs, Graphs, Random Sequences, Good - de Bruijn Digraph 


SECURITY CLASSIFI¬ 
CATION OF REPORT 
Unclassified 


SECURITY CLASSIFI¬ 
CATION OF THIS PAGE 
Unclassified 


SECURITY CLASSIFI¬ 
CATION OF ABSTRACT 
Unclassified 


NUMBER OF 
PAGES 146 


PRICE CODE 


LIMITATION OF 
ABSTRACT 

































Approved for public release; distribution is unlimited. 


Double Eulerian Cycles on de Bruijn 

Digraphs 

b y 

Gary William Krahn 

Lieutenant Colonel, United States Army 
B.S., United States Military Academy, 1977 
M.S., Naval Postgraduate School, 1985 

Submitted in partial fulfillment of the 
requirements for the degree of 

DOCTOR OF PHILOSOPHY IN APPLIED MATHEMATICS 

from the 

NAVAL POSTGRADUATE SCHOOL 
June 1994 





Craig W. Rasmussen 
Assistant Professor of Mathematics 


Sinphohg Lawphon£pcmich 
Associate Professor of Operations Research 



Yutaka Kanayama 
Professor of Computer Science 



Richard Franke 
Professor of Mathematics 


Approved by: 
Approved by: 


Harold M. Fredricksen 
Professor of Mathematics 
Dissertation Supervisor 



Richard S. Elster, Dean of Instruction 


ii 


















Abstract 


A binary de Bruijn sequence has the property that every n-tuple is distinct 
on a given period of length 2 n . An efficient algorithm to generate a class of classi¬ 
cal de Bruijn sequences is given based upon the distance between cycles within the 
Good - de Bruijn digraph. The de Bruijn property on binary sequences is shown 
to be a randomness property of the ZERO and ONE run sequences. Utilizing this 
randomness we find additional new structure in de Bruijn sequences. We analyze 
binary sequences that are not de Bruijn but instead possess the sufficient structure 
so that every distinct binary n-tuple can be systematically “combed” out of the se¬ 
quence. These complete or nonclassical de Bruijn sequences are a generalization of 
the well-known de Bruijn cycle. 

Our investigation focuses on binary sequences, called double Eulerian cycles , 
that define a cycle along a graph (digraph) visiting each edge (arc) exactly twice. A 
new algorithm to generate a class of double Eulerian cycles on graphs and digraphs 
is found. Double Eulerian cycles along the binary Good - de Bruijn digraph are 
partitioned by the run structure of their defining sequences. This partition allows for 
a statistical analysis to determine the relative size of the set of complete cycles defined 
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by the sequences we study. A measure that categorizes double Eulerian cycles along 
graphs (digraphs) by the distance between the two visitations of each edge (arc) is 
provided. An algorithm to generate double Eulerian cycles of minimum measure is 
given. 
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Table of S> tools and Notation 


This section consists of a list of frequently used mathematical symbols and notation. 


SETS 


INTEGERS 


SYMBOL 

MEANING 

x € 5 

x is a member of 5 

x £ S 

x is a not a member of S 

ScT 

S is a proper subset of T 

SCT 

5 is a subset of T 

0 

The empty set 

AnB 

Intersection of sets A and B 

AuB 

Union of sets A and B 

A-B 

Difference of sets A and B 
(relative complement of B in A) 

\A\ 

Cardinality of A 

V x T 

Cartesian product of sets V and T 

(si,...,s n ) 

n- tuple 

{sj, • • • i Sn} 

List of elements in a set 

Z + 

Set of positive integers 

a\b 

a divides b 

a mod b 

Remainder when o is divided by b 

a = b mod m 

a is congruent to b modulo m 
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FUNCTIONS 


[xj 

The greatest integer less than or equal to x 

M 

The least integer greater than or equal to x 

4> 

Euler’s totient function 

max(x, y) 

Maximum of x and y 

min (x,y) 

Minimum of x and y 

COUNTING 


PM 

Number of r-permutations of a set with n elements 

C(n,r) 

Number of r-combinations of a set with n elements 

C(n;m,n 3 ,...,n m ) 

Multinomial coefficient ( n ) = ,- ; 

CO 

Binomial coefficient 

GRAPHS AND DIGRAPHS 


(x>y) 

Directed edge or arc 

xy 

Undirected edge 

\\G\\ P 

Number of distinct walks in the graph G 
that traverse each edge exactly p times 

d c {x,y) 

Distance from vertex x to vertex y on a cycle 

do{x,y) 

Directed distance from vertex x to vertex y 

d e (x,e) 

Edge-distance from vertex x to edge e 

d{x,y) 

Distance from vertex x to vertex y 

d(x,V) 

Distance from vertex x to the set of vertices V 
where d(x, V) = min^v d(x,v) 

d e (x,E) 

Edge-distance from vertex x to the set of edges E 
where d e (x, E) = minegE d e (x, e) 
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EM(G) 

s v 

es v 

cr-e dges 

out(x) 

in(x) 

Pv{F) 

M f (D) 

B n 

%) 

5n-(fc+l) 

(a) 

(a). 

y(«)v 

(«)* 

W 7 

W u 

W 2 . 

G-{V,E) 
D = (V,A) 


Median of the graph G 
Edge-median of the graph G 
Status of vertex v 
Edge-status of vertex v 

Single-edges, i.e., edges that have been currently 
traversed once during a walk along a graph 
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Out degree of vertex x 

Indegree of vertex x 
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The set of double Eulerian walks beginning 
at vertex v 
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log 

TTIR 


»W 


Binary n-tuple 
Conjugate of x 
Conjugate of x 
Complement of x 
Addition modulo 2 

Feedback shift register of span n 

Pure cycle register of span n 

The set of de Bruijn sequences of length 2 n 

The set of balanced binary sequences of length 2 n 

The set of binary sequences of length 2 " with 
the run property 

A subsequence of consecutive 0 ’s that is 
preceeded and followed by a 1 . 

A subsequence of consecutive l’s that is 
preceeded and followed by a 0 . 

A sequence of n-tuples extracted from a 
2 "-long sequence, 5, by an (/,n)-comb, C 

The repetition number of the element k 
in the multiset D n 

Logarithm to base 2 

Abbreviation for: to the immediate right 

Assignment operator 

Depiction of a (7,4)-comb specified by 
s t+l» ”•) 5 t'+s) ~ 5 i+2> 5 *+s) 


XIV 


ACKNO WLED GEMENTS 


As in any multi-year project, there are many people to thank. I would like 
to thank the members of the staff and faculty in the Department of Mathematics at 
the Naval Postgraduate School who helped me often on a daily basis. Craig Ras¬ 
mussen, Van Hensen, and David Canright introduced me to LfrTjrX and answered a 
myriad of technical questions. Thanks go to the members of my committee; they are 
Hal Fredricksen, Craig Rasmussen, Richard Franke, Siriphong Lawphongpanich, and 
Yutaka Kanayama. Each, in their own way, provided encouragement and guidance 
during all phases of the dissertation. 

I would also like to acknowledge the support of Craig Rasmussen during the 
past three years. He taught me so much and provided many helpful suggestions 
throughout the preparation of this dissertation. Without his critical reading this 
documentation would be less than it now is, both in content and in style. His dedi¬ 
cation served me well. More importantly, he became a good friend. 

I am forever grateful to my advisor Professor Hal Fredricksen, for without him 
there would be no dissertation. He showed me the great beauty of mathematics, a 
vision I will keep for the rest of my life. Our friendship has continued for nearly ten 
years and has been an important part of my life. He always reminded me of the most 


xv 











r 


important issues in life, teaching me things not found in books. His confidence in this 
work always exceeded my own. 

Finally, I want to acknowledge my wife and best friend, Paula. She cared for 
my well-being and has given me so much. Without her this endeavor would never 
have been accomplished. 


xvi 













I. PERSPECTIVE 

“Begin at the beginning, ” the King said, very gravely, “and go on till you come 
to the end: then stop.” 

Alice’s Adventures in Wonderland, Chapter 12 

A. OVERVIEW 

The advent of modern high-speed communication hardware creates a need for 
high-speed techniques to generate random-like sequences. Most digital computers and 
many communication systems handle information in binary form. One of the sim¬ 
plest and most efficient devices for generating deterministic, random looking binary 
sequences is the shift register. Every periodic binary sequence is obtainable from 
some suitably constructed shift register. This generality allows great versatility in 
shift register applications. The applications for shift register sequences include secure 
data transmission [Ref. 1], robot path planning [Ref. 2], multiple address coding 
[Ref. 3], error correcting codes [Ref. 4], radar range measuring [Ref. 3], and random 
number generation [Ref. 5). 

It is well-known that a (2 n — 1 )-long sequence , i.e., a binary sequence of length 
2 n — 1, containing all the non-zero binary n-tuples can be obtained from an n-span 
shift register by means of a feedback function consisting entirely of modulo 2 additions 
[Ref. 6]. Such functions are called linear. A great deal of theory has been developed 
about linear feedback shift registers. Finite fields provide the underlying mathemat- 
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ical foundation for the linear feedback shift register. Once we remove the restriction 
that the feedback function be linear, allowing products of variables, the fundamen¬ 
tal structure of the shift rigister changes dramatically. As we expand from linear 
to nonlinear feedback functions the number of maximum length binary shift register 
sequences of degree n, 2 n -long sequences with distinct n-tuples, increases from less 
than to exactly 2 2 „ . Unlike in the linear case, for a general nonlinear feedback 
function it is often true that the best way to determine the resulting sequence is to 
exhaustively construct the state tree, since there are few algebraic approaches for 
the nonlinear analysis. Many fundamental questions remain to be answered for the 
nonlinear problem. There is a need for expanded techniques and tools. 

In a sense, no finite length sequence is ever truly random. In particular, no 
sequence that depends on a rather small number of parameters, such as the feedback 
connections of a feedback shift register, can be considered truly random. These 
sequences, however, have the balance and run randomness properties expected of 
random sequences as defined by Golomb [Ref. 6]. We consider only sequences of 
finite length. Furthermore, the end of the sequence is considered to be contiguous 
with the beginning of the sequence, hence, sequences and cycles can be considered to 
be equivalent. 

B. CURRENT RESEARCH 

An important problem currently under consideration with respect to nonlinear 
sequences is that of finding a broad class of functions that yield near-maximal length 
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cycles. Ford [Ref. 7], Lempel [Ref. 8], and Fredricksen [Ref. 9], among others, have 
provided methods for constructing de Bruijn cycles and others have expanded upon 
their ideas. Additionally, classes of functions that yield pairs of cycles of length 2 n_1 
are examined by Kibler [Ref. lOj. But here, as in so many other facets of the analysis 
of nonlinear functions, even though the structure suggests some underlying means of 
classifying these functions, the discriminating factor has proven to be elusive. 

A de Bruijn cycle (or de Bruijn sequence) of length 2 n has the property that ev¬ 
ery n-tuple appears exactly once on a given period. For some applications it might not 
be necessary that the n bits of interest lie consecutively along the sequence. Our work 
examines an apparently unexplored area with respect to de Bruijn cycles. Although 
novel, this research has parallels to the current direction of study to find classes of 
de Bruijn cycles. We analyze sequences of length 2 n that are not de Bruijn but 
that have been found to possess sufficient structure so that every distinct binary 
n-tuple can be systematically “combed” out of the sequence - i.e., every n-tuple can 
be found appearing at the successive positions in a subsequence. We find that in this 
way the properties that make de Bruijn cycles so attractive can be extracted from 
sequences that apparently do not contain each n-tuple. 

C. DESCRIPTION OP THE THESIS 

A simple method to extract n-tuples from a sequence is to cycle the sequence 
through the pure cycling register of length 2 n (PCRi*) [Ref. 6], recording the bits of n 
arbitrary (not necessarily consecutive) but fixed registers at each shift of the PCRi •». 
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More precisely, we define a functional F{fu / 2 , • • •, / 2 ")< whose domain consists of the 
2 n functions The function /, projects the / bits ($,, s,+i,... 

l > n, from a 2 n -long binary sequence ($i, $ 2 ,..., s 2 ") to a binary n-tuple. We restrict 
each of the functions fi in T to be the projection from B 1 to B n defined by 

^i+lt • • • i — ('®*»1 • • • i &in-2 > •*«+/— 1 )> 

where i < j\ < ■ • • < j „_2 < i + / — 1. 

It is useful to visualize each /, as an I-long window containing n viewing 
stations or as a comb of length l with n surviving teeth, denoted as an ( >,n)-comb. 
It should be noted that an (/, n)-comb is always to be applied to a sequence of length 
2 n since our intent is to extract 2 n unique n-tuples. 

The set of all 2 n -long balanced sequences, i.e., those with an equal number of 
l’s and 0’s, is denoted by «S„. If S = 0001010001101111, then 5 is an element of 
$4 that is clearly not de Bruijn, i.e., 4-tuples are repeated. Hence, the (4,4)-comb 
consisting of 4 consecutive teeth is not a comb for 5. The (8,4)-comb specified by 

^«+l> ^«+2t ^«'+3 j ^»+4? ^«+5i S»+6 i <Sj+ 7 ) = (®i» >®*-H» $t+2» 

however, generates the following sequence of distinct 4-tuples from S: (0000, 0010, 

0101 , ion, oioo, iooi, oooi, ooii, oin, lioo, ioio, ono, mi, mo, 1101 , 1000 ). 

It is convenient to depict combs by a symbol that portrays the teeth of the 
comb in a fairly obvious way. For example, the (8,4)-comb specified above by 

$1+1,3|-f2, Sj-f3, $i+4i ^i+5i £,+6, Sj+7) = (Sj, ^«+l, ^»+7) 
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is represented by the symbol ll1 **** 1 . It can easily be verified that 5 has the following 
(/,4)-combs: l|wH . I ‘ |,, **l***' . ( Note that these combs are 

distinct combs and are not cyclic shifts of each other.) 

We see that a sequence can have several associated combs. Some sequences, 
however, have exactly one comb while other sequences have no combs at all. Clearly, 
a sequence of length 2 n composed of a de Bruijn cycle of length 2 n_l concatenated 
with itself cannot have a comb since, for each i, 

f(Si,Si + 1 , . . . , Si+/_i) = /(.Si+jn-i»•Si+1+2"- 1 > • • • , Si+|-l+2»“‘ ). 

For the same reason any sequence of length 2 n possessing a periodic part of period d, 
where d is a proper divisor of 2 n , cannot have a comb. 

Fundamentally, the search for sequences and combs can proceed in either of 
two ways: either by finding a sequence satisfying a particular comb or by finding a 
comb that satisfies a particular sequence. In either case, the appropriate sequences 
are necessarily of length 2” and are therefore generated by polynomials, over the field 
of two elements, of the form fk{x) = (x + 1)* for some 2 n_1 < k < 2 n [Ref. 6]. Only 
those sequences from this set that are balanced need be considered. 

Occasionally, we categorize an (/, n)-comb as an m-comb if at most m of its n 
teeth are consecutive. We establish a one-to-one correspondence between sequences 
and walks along a graph or digraph. We show that a sequence of length 2 n with an 
(n — k) comb, 1 < k < n, traverses each arc in the digraph Z? n _(fc+i) exactly 2 fe times. 
(The digraph ~B n -(k+ 1 ) is isomorphic to the Good - de Bruijn digraph B n ,(k+i}-) This 
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property has far-reaching implica ms. We find that walks traversing each arc of the 
Good - de Bruijn digraph provide a common thread throughout each of the following 
chapters. The set of walks that visit each edge (arc) in a graph (digraph) exactly 7 
times is denoted by W 7 . 

The primary goal of the research presented here is to provide a better under¬ 
standing of the de Bruijn property of distinct n-tuples, binary de Bruijn cycles, and 
the Good - de Bruijn digraph. The thesis consists of 7 chapters: 

Chapter II: The purpose of this chapter is to make the thesis self-contained. 

We provide a concise introduction to graph theory, feedback shift registers, and 
their resulting sequences. This introduction is suited for the analysis of double 
Eulerian cycles, binary de Bruijn cycles, and the Good - de Bruijn digraph. 
Apart from basic definitions, Chapter II also introduces definitions that are not 
in common use but proved to be convenient in the nonlinear theory of periodic 
sequences. We introduce some initial analysis on double Eulerian cycles , cycles 
that traverse each edge (arc) along a graph (digraph) exactly twice. 

Chapter III: We investigate sequences called complete cycles or nonclassical 
de Bruijn cycles. A complete cycle of length 2 n has the property that each of 
the possible 2 n binary n-tuples lies along a fixed pattern or (/, n)-comb of the 
sequence. The analysis of these complete cycles is primarily concerned with 
combs where n — 1 of the bits of interest lie consecutively along the sequence. A 
characterization of a class of complete cycles is made in terms of the walk they 








define along an appropriate Good - de Bruijn digraph. A statistical analysis 
is made to determine the number of complete cycles defined by a particular 
class of sequences. 

Chapter IV: To analyze (n—l)-combs, i.e., those with n -1 consecutive teeth, 
some essential theory on double Eulerian cycles along graphs and digraphs is 
developed. We define a measure that categorizes double Eulerian cycles by 
a function of the two visitations along each edge in the traversed graph. In 
essence, the measure describes where particular n-tuples are located on the 
sequence in some rough sense. This measure parallels the discrete logarithm 
problem of finite fields [Ref. 11] that forms the foundation of some current pub¬ 
lic key cryptography systems. A conjecture is given for the minimum measure 
(or value) of a Good - de Bruijn digraph. 

Chapter V: We describe how the de Bruijn property of distinct n-tuples re¬ 
sults from a randomness property of the run lengths in a binary sequence. The 
run structure of the sequences defining an Eulerian or double Eulerian cycle 
along the Good - de Bruijn digraph is completely determined. A statistical 
analysis shows that the property of containing distinct n-tuples in a binary de 
Bruijn cycle of length 2” is equivalent to the Expected Value Property for run 
lengths in a random binary sequence. 
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Chapter VI: The concepts derived in Chapter IV are used to develop a new 


algorithm to generate classical de Bruijn cycles. The algorithm is based on 
the edge-factors of the Good - de Bruijn digraph and the distance between 
cycles. The edge-factors of a graph are found to be an important element to 
determine the minimum measure of a graph or digraph. 

Chapter VII: Here we summarize the results of the thesis. Directions for 
further research are identified and open problems are discussed. 

In summary, the ensuing chapters emerge from the development of combing 
sequences in Chapter III. More importantly, each result is inspired by the desire to 
gain a greater understanding of nonlinear binary sequences in general. 

D. HISTORICAL NOTE 

PROBLEM: Given m symbols (which, without toss of generality, we take to be 
0, 1, , m-1) and a positive integer n, find a sequence of these symbols having 

minimum length, that when arranged as a cycle, contains every sequence of n 
consecutive symbols. 

A solution to this problem is a de Bruijn cycle of length (m) n . In 1951 van 
Aardenne-Ehrenfest and de Bruijn (Ref. 12] showed that de Bruijn cycles exist for 
all m > 2 and n > 1. In fact the de Bruijn cycle problem has been independently 
rediscovered many times. 


8 












f 


The problem of finding de Bruijn cycles for m = 2 became well-known through 
de Bruijn’s paper [Ref. 13], where the number of solutions was found to be 2 2n ~ l ~ n . 
de Bruijn [Ref. 14], however, credits Stanley for discovering that the problem had been 
proposed and solved half a century earlier in the French problem journal I’Intermediaire 
des Mathematiciens in 1894. The problem was proposed by de Riviere in 1894 and 
solved by FI) 3 Sainte-Marie [Ref. 15] that same year. Flye Sainte-Marie found the 
same number, 2 2 " _I-n , and his method of solution was paralleled by de Bruijn. Three 
years later, Mantel [Ref. 16] found a solution whenever m is prime using an algebraic 
method. After 1897, the problem was apparently entirely forgotten until 1934, when 
it was reintroduced by Martin [Ref. 17]. Martin approached the problem combina- 
torially and proved the existence of de Bruijn cycles for all m and n by creating an 
algorithm to construct such cycles. A decade after Martin, de Bruijn [Ref. 13] and 
Good [Ref. 18] independently rediscovered and solved the problem for the case m = 2 
using graph theoretic and group theoretic concepts. 

The corresponding problem for m > 2 symbols was first raised and solved 
in 1951 [Ref. 12]. The number of solutions was found to be using 

methods of determinants on the adjacency matrix representing the Good - de Bruijn 
digraph. 

Subsequently, algorithms to generate some or all of the de Bruijn cycles of 
length 2 n have been repeatedly uncovered. It is a palatial problem that will undoubt¬ 
edly lure attempts to unravel it again and again. 
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II 


DEFINITIONS AND NOTATION 


“...But do cats eat bats, / wonder?” And here Alice began to get rather 
sleepy, and went on saying to herself, in a dreamy sort of way, “Do cats eat 
bats? Do cats eat bats?” and sometimes “Do bats eat cats?” for you see, as 
she couldn’t answer either question, it didn’t much matter which way she put 
it. 


Alice’s adventures in Wonderland, Chapter 1 


A. INTRODUCTION 

Before describing double Eulerian cycles, de Bruijn cycles, and the methods 
and techniques we need to address in our work, it is necessary to provide applicable 
definitions and notation. A detailed description of linear and nonlinear shift register 
sequences appears in Golomb [Ref. 6]. A text on graph theory such as Bondy and 
Murty [Ref. 19], provides a thorough discussion on the relevant material on graph 
theory. The reader who is thoroughly familiar with the vocabulary and concepts of 
graph theory and de Bruijn cycles may wish to skip most of this chapter. 

B. GRAPHS 

A graph G consists of a set V = {ui,U 2 > • • • ,v P }> of elements called vertices 
(or nodes) and a set E — {ei,e 2 ,...,e,} of unordered pairs of vertices called edges. 
The edge between vertex x and vertex y is written as xy or yx. The graph G is said 
to have order p and size q. We write G = (V, E) and say V is the vertex set and E 
is the edge set. 
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Edges of the form e = u,u, are called loops. A graph without loops is called 
simple if it also has no pair of vertices forming more than one edge. A graph is finite 
if its order is finite. We will consider only graphs that axe finite, but not necessarily 
simple. 

Edge e = V{Vj is said to be incident to the vertices v, and v y Similarly, v, and 
Vj are incident to the edge e. Two vertices that are incident to a common edge are 
adjacent , as are two edges that are incident to a common vertex. A set of vertices 
(edges) is independent if the vertices (edges) are mutually nonadjacent. The graph 
G' ~ G — {v} is the graph G with the vertex v and its incident edges removed. 

A subgraph of a graph G = (V, E) is a graph H = (V 1 , E') where V C V and 
E' C E. Suppose that V' is a nonempty subset of V. The subgraph of G whose vertex 
set is V' and whose edge set is all of those edges of G for which both incident vertices 
are in V' is called the subgraph of G induced by V'. Suppose that E' is a nonempty 
subset of E. The subgraph of G whose vertex set is the set of all vertices incident to 
the edges of E' and whose edge set is E' is called the subgraph of G induced by E'. 

A walk W = (vj,ei,V 2 ,ej,...,e*-i,Ufc) in G is an alternating sequence of 
vertices and incident edges, beginning and ending on a vertex of G. A walk may 
have repeated edges and repeated vertices. The length of a walk is the number of 
edges in the sequence. A walk can also be identified by merely listing the sequence 
of adjacent edges or, in a graph without multiple edges, by listing the sequence of 
adjacent vertices. 
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Graph 
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Figure 1. Graphs and a simple graph 

A walk is said to be closed if the initial and terminal vertices are the same. 

A closed walk is called a cycle. When it is not important to designate the initial 
vertex of a cycle, the terminal vertex will typically not be listed in the sequence of 
adjacent vertices. Using this notation, any cyclic shift of a cycle is considered to be 
the same cycle. Further, this allows a distance to be defined between the vertices 
(edges) of a simple cycle. The distance between the vertices u, and Vj on a simple 
cycle C = (vo, t>t,..., u p _i) is denoted 

d c (vi, vj) = min ((* - j) mod p, (j - i) mod p). 

Similarly, the distance between the edges e, and ej on a simple cycle C = (eo, ej,..., e p _j) 
is 

d c (e„ ej) — min ((*' - j) mod p, (j - i) mod p). 

A walk with distinct edges is called a trail. If the vertices of a trail are distinct, 
the walk is called a path. A closed path is called a simple cycle. If 

(ui,ei,v 2 ,e 2 ,...,e*_i,t; fc ) 
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G : v i 


Figure 2. Cycles 

is a walk, v\ is the initial vertex, the set of interior vertices is {u 2 , V3,..., Vfe_ 2 , 
and v/t is the terminal vertex of the walk. That is, all of the vertices of the walk 
that occur somewhere other than as the initial and terminal vertices of the walk are 
interior. The set of all walks along a graph beginning at vertex v is denoted by VV„. 
The set of all walks in which each of the edges appears exactly 7 times is denoted by 
W 7 . Additionally, the set of all walks in W„ where each of the edges appears exactly 
twice is denoted by W 2 .. 

A cycle C of period p contains p edges and we write \C\ = p. If C is a cycle, 
but not a simple cycle, then C is said to be reducible. Evidently, a cycle is reducible 
if it is not a closed path. The closed path (ui,n 2 ,U 3 ,U 4 ,U 5 ,t> 6 ) in Figure 2 is both a 
cycle and a simple cycle. The closed trail C = (u 2 , v 6 , v 3 , v s , v 6 , i>i) is a cycle but not 
a simple cycle, i.e., C is reducible. The cycle C can be reduced to the two simple 
cycles (v 2 ,ve,vi) and (v e ,v 3 ,vs). 

The degree of a vertex u in a graph, denoted deg(u), is the number of edges 
incident to v. A loop at a vertex contributes two to the degree of that vertex. 
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A graph G is said to be connected if, for all i,j, there exists a path connecting 
Vi and v r A graph that is not connected is said to be disconnected. 

A trail in a graph G is called Eulerian provided it contains every edge of G 
exactly once. An Eulerian cycle in G is a cycle that includes each edge of G exactly 
once. A walk in a graph G is called double Eulerian if it contains every edge in G 
exactly twice. A path in a graph G is called Hamiltonian provided that it contains 
every vertex of G exactly once. A Hamiltonian cycle in G is a cycle that includes 
each vertex of G exactly once. 

The following theorem is a well-known result in graph theory. See, for example, 
Bondy and Murty [Ref. 19]. 

Theorem II. 1 A connected graph G has a closed Eulerian trail if and only if 
the degree of each vertex is even. 

A partition of a set X is a family {X,|t € / C Z + } of non-empty subsets of X 

such that X = [J Xi and the family is pairwise disjoint. 

«€/ 

A factor of a connected graph G = ( V , E) is a set of cycles in G that induces a 
partition of V. In a similar manner, an edge-factor of a connected graph G is a set of 
cycles in G that induces a partition of E. The set of cycles {(uj, v 2 , ue)» (*> 3 , t> 4 , U 5 )} is a 
factor of the graph G in Figure 2. The set of cycles {(vi, v 2 , 1 %), (t> 3 , v 4 , v s ), (v 2 , v s , v 6 , u 3 )} 
is an edge-factor of G. 
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Figure 3. A graph with a reducible cycle 

The subgraph induced by the edges of a reducible cycle has a nontrivial edge- 
factor. For example, let the edge-factor F of G in Figure 3 consist of the cycles 
Ci = (v 2 ,v 6 ,v 3 ,vi,v 6 ,vi) and C 2 = (u 2 ,u 3 , The subgraph H of G in Figure 3 

is induced by the edges of C\. Since C\ is reducible, H has a nontrivial edge-factor con¬ 
sisting of the cycles A = (t> 2 , v 6i t>i) and B = (ve, v 3 , v 5 ). The set of cycles {C 2 , A , B} 
is an edge-factor of G, denoted by Fa,b- 

A tree is a connected acyclic graph, i.e., the path connecting any two vertices 
is unique. Any vertex in a tree can be distinguished as the root of the tree. Every 
path from the root has a last or terminal vertex. The height of a rooted tree is the 
length of the longest path from the root. 
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Two graphs G\ = {V\,E\) and G 2 = (Y 2 ,E 2 ) are isomorphic if there is a 
bijection / from V x to V 2 such that f(x)f{y) is an edge of G 2 if and only if xy is an 
edge of G x . 

The distance, d(x,y), between two vertices x and y in a connected graph G is 
the minimum length of a path joining them. The edge-distance, d t (v,e), between a 
vertex v and an edge e = xy in G is 

d e (v, e) = min (d(t>, x), d(v, y)). 

Given a subset X C V of vertices in G = ( V, E), the distance between a vertex v € V 
and X is 

d(v , X) = mind(v, x). (II.l) 

Similarly, given a subset Y C E of edges, the edge-distance from v to Y is 

d e (v,Y) = nund e (v,y). (II. 2 ) 

Let G = (V, E) be a connected graph. The status , s v , of a vertex v in G is 
defined by s v = ^ d(v,n). The median M(G) of a graph G is the set of vertices 

n€V 

of minimum status. In a similar manner, the edge-status, es v , of a vertex v 6 G is 

defined as es v = d 9 (v,e). The edge-median EM(G) of a graph G is the set of 

e€£ 

vertices of minimum edge-status. 

For example, in Figure 4 for the graph G we find s v , = 9, s*, = 8 , es Vl = 4, 
esv, = 4, M(G) = {u a }, and EM(G) = {vi,^}- 
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Figure 4. A graph for which M(G) ^ EM{G) 



Figure 5. Digraphs 


C. DIGRAPHS 

A digraph (or directed graph) D = (V, A) has a set V of elements called vertices 
and a set A C V x V of ordered pairs of (not necessarily distinct) vertices called arcs. 
We think of the arc a = (x, y) as leaving x and entering y, that is, directed from x to 
y; x is the initial vertex of a and y its terminal vertex. A digraph may contain each 
of the arcs (z,y) and (y,x) as well as loops of the form (x,z). A loop (z, x) enters 
and exits the same vertex x. A general digraph may also include multiple arcs. 
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A vertex of the digraph D has two associated degrees. The outdegree of a 
vertex v, denoted out(u), is the number of arcs for which v is the initial vertex. The 
indegree of v, denoted in(t>), is the number of arcs for which v is the terminal vertex. 
The loop (x,x) contributes 1 to each of in(x) and out(x). 

The definitions of walk, path, trail, factor, and cycle carry over from graphs to 
digraphs in a fairly obvious way. For example, a directed walk in a digraph D = (KA) 
is a sequence of vertices and arcs, (vi,cti,V 2 ,with the property that 
(vj, € A for 1 < i < k — 1. A directed walk is a directed path if ail of its vertices 
are distinct, a directed trail if all its arcs are distinct, and a closed directed walk if the 
initial and terminal vertices are the same. A closed directed walk is a cycle-, a closed 
directed path is a simple cycle. 

For any graph G = (V, E) we obtain a digraph D = (V, A) by giving each edge 
xy € E an orientation, that is, by replacing xy with either (x, y) or (y, x). Such a 
digraph D is called an orientation of G. A graph has 2^' orientations if it has no 
loops and 2^~ k orientations if G has k loops. Conversely, given a digraph D = (V, A) 
we can remove the direction of its arcs thereby obtaining a graph G = (V, E). Such a 
graph is called the underlying graph of G. A digraph D has exactly one underlying 
graph, denoted by Go- 

A weak digraph is one that has a connected underlying graph. A digraph 
D = (V, A) is said to be strong provided that for each pair of distinct vertices x, y € V 
there is a directed walk from x to y and a directed walk from y to x (i.e., there is a 
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Figure 6. An illustration of an edge-factor of a directed yraph 

directed cycle that includes y and a;). A directed trail in a digraph D = (V, A) is called 
Eulerian provided that it contains every arc of A exactly once. A directed path is said 
to be Hamiltonian if it contains every vertex of V exactly once. A Hamiltonian cycle 
is one that contains every vertex of V exactly once, where the initial and terminal 
vertices may be considered the same. 

A factor of a weak digraph D = (V, A) is a set of cycles in D that induce a 
partition of V. Similarly, an edge-factor of a weak digraph D is a set of cycles in D 
that induce a partition of A. 

As an example, the set of cycles {(v t ), (v 3 , V3), ( 04 )} in Figure 6 is a factor 
of graph ffj. The set of cycles F = {(ui), (ui, t/j, U3), (t/ 2 , V4, 1/3), (t/ 4 )} constitute an 
edge-factor of B%. 

For vertices x and y in a strong digraph D, the directed distance dp(x,y) is 
the length of a shortest directed path from x to y in D. The directed distance from 
any vertex to itself is zero, i.e., do(x,x) = 0. Unless the digraph D is symmetric, it 
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is not generally the case that do(x,y) = dp(y,x) for all vertices x,y in V. Therefore, 
directed distance is not in general a metric. The distance, d(x, y ), between two vertices 
x and y in a weak directed D digraph is the minimum length of a path joining them 
in the underlying graph of D. 

Given an edge-factor F of a digraph D = {V, A), the posture, p v (F), of a vertex 
v 6 V is defined by 

Pv(F)= ^2\C\d e (v,C), (II.3) 

C€F 

where \C\ is the number of edges in the cycle C, d e (v,C) = mind e (u,x) in Go, the 
underlying graph of D, and the summation is over all of the cycles of the edge-factor. 

The mean for the factor F of a directed graph £>, denoted by Mp(D), is defined 
to be the set of vertices with minimum posture for the edge-factor F. As an example, 
for the edge-factor F of B? we find p v ,(F) = 10, p^F) = 4, and M F (B 2 ) = {v 2 , u 3 ). 

D. GOOD - DE BRUIJN GRAPHS AND DE BRUIJN 
CYCLES 

The original formulation of the de Bruijn cycle problem for m = 2 can be 
viewed as finding the number of Hamiltonian paths in an appropriate directed graph. 
This graphical interpretation is very useful in understanding the properties of 
de Bruijn cycles. 

A binary feedback shift register of span n (FSR„) is a collection of n storage 
devices (i 0 , Zi,..., z n _j), each capable of holding an element of B = {0,1}, together 
with a feedback function /(zq, xi,. . . taking on a value of 0 or 1, computed 
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from the contents of the n storage devices. The contents of the register at time t, 
regarded as a binary n-tuple or a binary vector, are called the state of the register. A 
FSR„ has 2" possible states, namely the elements of the set B n of all binary n-tuples. 
The feedback function /(x) of the FSRn, where x = (x 0 , x lt ..., z n -i), induces a 
mapping F: B n -+ B n , where F(x) = y if and only if 

{ 3-i+i! i = 0,..., n 2 

}{x ); i = n - 1. 

At the beginning of each time interval, determined by an external clock, there is a 
transition from one state to the next. 

The superposition of all possible state transition graphs for each positive in¬ 
teger n defines the binary Good - de Bmijn digraph of order n, denoted by B n . See 
Figure 7 for some examples of de Bruijn graphs. Thus, the Good - de Bruijn di¬ 
graph B n is a directed graph with 2" vertices, each labeled with a unique binary 
vector of length n, and an arc (x,y) from vertex x = (z 0 , xi,... to vertex 

y = (yo,yi,-..,yn-i) if and only if (x^xj,...^,,-!) = (yb,yi,... ,y n - 2 ). We call y a 
successor of x and x a predecessor of y. If (x,y) € A, x is adjacent to y in B n . The 
conjugate x of the n-tuple x = (xo,xj,...,x n _i) is defined as x = (xo,xi,...,x^), 
where x, = x, © 1 and © denotes addition modulo 2. The companion x' of the n-tuple 
x is defined asx / = (xo,xi,... ,x n _i). 

Each arc in B n can be viewed as an element of the (n + l)-dimensional binary 
vector space B n+l . The arc between the vertex (x<),...,x n _i) and vertex (xi,...,x n ) 


21 






in £ n can be labeled by the (n + l)-tuple (x 0 ,Xi,-. .,x n . u x n ). The total number 
of arcs in B n is 2 n+1 , and each (n + l)-tuple is assigned to a unique arc. With this 
labeling we can inductively construct the digraph fi n+l . We include an arc from 
x € B n+1 to y € £ n+1 in this new graph B n +i whenever the terminal vertex of the 
arc x € B n is the same as the initial vertex of the arc y € B n . We see that a vertex 
x is adjacent to a vertex y in the induced digraph when the last n coordinates of 
x match the first n coordinates of y in the respective (n + l)-tuples representing x 
and y. This induced digraph is isomorphic to fi n+1 by the identity mapping. Thus, 
a closed Eulerian trail of length 2 n+1 visiting every arc in B n defines a Hamiltonian 
cycle visiting every vertex in £ n+1 . Since (0,1) is a Hamiltonian cycle in B u and 
there exists a Eulerian trail in £ n , n > 1 , we conclude that a Hamiltonian cycle exists 
in B n for n > 1. 

Another way to label the arcs in the digraph B n is to simply label the arc going 
from the vertex (x 0 , • •., x n -i) to the vertex (*j,...,x n ) with the single bit x n . The 
set of labels encountered as we trace the arcs along a Hamiltonian cycle (path) in Bn 
generates a binary de Bruijn cycle (sequence). Such a cycle is periodic of period 2 n 
and contains each of the 2 n different binary n-tuples exactly one time in each period 
of the sequence. The set of all de Bruijn cycles of length 2" is denoted as S n . 

A subsequence of n consecutive terms (or an n-sequence) from a sequence 
S = (si,Sj,..•,Sj«) is a string of the form (s„s f+ i,...,s, + „_t), where we use the 
convention that the subscripts are taken modulo 2 n . That is, we allow an n-sequence 
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Figure 7. Good - de Bruijn graphs B n , 1 < n 
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of the form 


{$2 n -j » ^2 n -j+l j — l)i j 0, 1,. . . , Tl 2, 

which “wraps around” from the end of S to the beginning. Any circular permutation 
of 5 is considered to be the same sequence. 

A cycle of period p in B n is an ordered set of distinct vertices (states) 
(i>Oi Vy ,... such that i>i + i is the successor of u„ i.e., v, + i = F(vi), 

i = 0,1,... ,p — 2, and t>o = F(v p - i) . Clearly, an edge-factor in B n is isomorphic to 
a factor in B n +i • 

In addition to having two successors, each vertex in B n has exactly two pre¬ 
decessors. Moreover, if vertex x is a predecessor of vertex y, the following is always 
true: 

1 . x is a predecessor of y. 

2 . x is a predecessor of y'. 

3. x is a predecessor of y'. 

The four vertices x, x, y, and y' are commonly called an adjacency quadruple. An arc 
(x,y) is directed from vertex x to vertex y and called incident with both x and y. 
Conversely, x and y are incident to the arc (x,y). A vertex x = (x 0 ,xi,... ,x n -i) in 
B n is incident to the arcs (0,xo,xi,... ,x n _j), (l,x 0 ,xi,... ,x„_x), (x 0 ,xi,.. .,x n _i,0), 
and (xo,Xi,...,x n _i,l) that constitute an adjacency quadruple in B n+ 1. 

In an edge-factor of B n , each arc uniquely identifies its cycle. The cycle to 
which the arc a belongs is denoted by (a). The set of incident vertices for a given 
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cycle (a) is denoted by (a)„. Two cycles (a) and (0) are adjacent if they are arc 
disjoint and there exists an arc 7 on (a) whose conjugate 7 is on (0) (i.e., a , 0 are 
both incident to same vertex u). It follows that (a) and (0) are adjacent whenever 
(a) and ( 0 ) are disjoint and (a)* H (0) v ± 0. Consequently, each vertex is incident to 
2 distinct cycles in an edge-factor consisting of simple cycles. 

Furthermore, let v (a) v denote the set of vertices i>, € (a) v for which 
d(y,Vi) = rf(y, (a)„). From each set y (a) v a specific vertex v t € y (a)„ is designated as 
the representative vertex of (a), denoted by u; = v (c*)r. 

Recall that the distance , d(x, y), between two vertices x and y in a weak 
digraph is the minimum length of a walk joining them in the underlying graph. From 
Equation II.l on page 16, the distance, d(y, (a) v ), between a vertex y and the set of 
vertices (a)„ in B n equals min {d(y, 1)}. 

r€(o)» 

One factor occurs so often that it has been given a special name. The Pure 

Cycling Register Factor consists of cycles formed by the cyclic rotation of the bits 

in the Pure Cycling Register, PCRn- Golomb [Ref. 6] shows that the number of 

cycles, Z(n), in the PCRn is given by Z(n) = ~Y]<t>{d) 2^, where 4> is Euler’s 

n 7r 

d\n 

totient function and the summation is over the divisors of n. See Table I for values of 
<(>(n) and Z(n). Mykkeltveit [Ref. 20] shows that no shift register (linear or nonlinear) 
of length n can generate a factor containing more than Z(n) cycles. 
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n 

4>{n) 

Z(n) 

1 

1 

2 

2 

1 

3 

3 

2 

4 

4 

2 

6 

5 

4 

8 

6 

2 

14 

7 

6 

20 

8 

4 

36 

9 

6 

60 

10 

4 

108 

11 

10 

188 

12 

4 

352 


Table I. Values of Euler’s function <f>(n) and of the cycle function Z(n), 1 < n < 12 
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III. COMBING SEQUENCES 


“/ want a clean cup, ” interrupted the Hatter. “ Let’s all move one place on. ” 
He moved on as he spoke, and the Dormouse followed him. The March Hare 
moved into the Dormouse’s place and Alice rather unwillingly took the place 
of the March Hare. The Hatter was the only one who got any advantage from 
the change. 


Alice’s Adventures in Wonderland, Chapter 7 


A. INTRODUCTION 

In Chapter I, Section C, we introduced the functional T{f\, / 2 ,..., / 2 ->), whose 
domain consists of the 2 n projections from B l to B n defined on a sequence of period 
2 n by 

Sj+l, . . . , St+i-1) = ($»» S ii » • • • » &jn~2 * ■*»+/—1 )i 

where i < j x < • • • < j n _ 2 < i + / — 1. 

The function fi can be viewed as an /-long window containing n viewing sta¬ 
tions or as comb of length / with n surviving teeth, which we call an (/,n)-comb. 
Occasionally, we categorize such a comb by its longest string of consecutive teeth. 
An m-comb denotes a comb with at most m consecutive teeth. It is also sometimes 
convenient to normalize a comb so that a longest string of consecutive teeth always 
appears on the left. For example, the (5,3)-comb, l**N . defined on a 2 3 -long sequence 

by 

•S t '+i, St+ 2 ) ^+ 3 , 5 ,^ 4 ) = (Sj, St+3, Si+ 4 ), 
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is normalized to the (6,3)-comb, ^***L specified by 


/ii($t> ^i+li ^«+2i ^«+3» ^«+4i •Si+s) — ( , S»> -Si+li ^«+5)» 

i.e., nor ^^** <i (Each are 2-combs). There also may be instances when one 

wants to use the version of the comb that has the smallest length l. 

If a sequence, S„, of length 2 n has the property that some (/,n)-comb can 
extract every different n-tuple, then 5„ is called complete or nonclassical de Bruijn. 
It follows that the reverse and complement of a complete sequence are also complete. 
By definition, every de Bruijn cycle is complete. 

B. SEQUENCES AND THEIR EXTRACTED n-TUPLES 

To illustrate these concepts, consider the sequence 5 of length 2* given by 
S — 0000111100101101. Using the classical de Bruijn (4,4)-comb denoted by the 
symbol, C = 1 , and specified by 


s »+ 1 j ^i+ 2 ) ^t+3) — Si+2> Sj+3), 

4-tuples are extracted from 5 in the following order: f\ —► 0000, / 2 —► 0001, 

fz —► 0011, /g —> 0111, fa —► 1111, fe —► 1110, fr —► 1100, fg —► 1001, fg —> 0010. 

fio 0101, fix —► 1011, fi 2 —> 0110, /13 —> 1101, /14 —* 1010, /15 —* 0100, 

/is -> 1000. The (4,4)-comb applied to 5 generates a sequence T(C,S), of 4-tuples 
that specifies a Hamiltonian cycle along the Good - de Bruijn graph B\. Figure 8 
represents this cycle. We see that the sequences, S and T(C , 5), define the same cycle 
around B n . 
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Figure 8. A Hamiltonian cycle in the Good - de Bruijn digraph B A 
The (8,4)-comb, C' — M****l , specified by 

5 I+Ji 5i+j| 5 «+3» 5 «'+4j 5 i+5i ^«+ 6 ) • s »'+7) = (<$,, 3,+j, 5,+ 2 , ^,+ 7 ), 

generates the following sequence, T(C',5), of distinct 4-tuples from S: (0001, 0000, 

0010, oin, mo, mi, noi, iooo, oon, 0100,1010, ono, noo, ion, 0101,1001). 

The sequence T(C', 5), however, does not represent a Hamiltonian cycle along the 
de Bruijn digraph B\ since there are consecutive 4-tuples in T(C',S ) that are not 
adjacent in B A . Since this (8,4)-comb has 3 consecutive teeth (i.e., it is a 3-comb), 
the sequence S must specify a walk along the graph B 3 (Figure 9A) that visits every 
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Figure 9. Good - de Bruijn digraph B 3 


vertex exactly twice. The visitation times for each vertex in this walk are shown in 
the parentheses in Figure 9B. 

Let C be an 3-comb for the sequence S. It follows immediately from de Bruijn’s 
doubling theorem [Ref. 13] that the sequence S, since it visits every vertex of iQ 3 twice, 
also induces a double Eulerian cycle along the digraph Bj. Furthermore, the sequence 
T(C,S), of 4-tuples specifies a double Eulerian cycle along B<_( 1+ i), a digraph iso¬ 
morphic to B% [See Appendix B]. These two respective cycles, with visitation times 
in parentheses, are shown in Figure 10. The sequences T(C, S) and S define identical 
walks along the isomorphic graphs B 4 _( i+1 ) and Bj, respectively. 
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Figure 10. and the Good - de Bruijn digraph B 2 


Lemma III.l Let S n be a complete binary sequence of length 2 n with an 
associated (n-k)-comb. Then S n specifies a walk that traverses each arc along 
the digraph fi n -(*+i) exactly 2 k times where (k + 1) < n. 

Proof: Since S„ is complete, each (n — fc)-tuple occurs exactly 2 k times. The 
arcs in the Good - de Bruijn digraph, 1 ), are uniquely represented by 

(n — A:)-tuples. Therefore, 5 n , specifies a walk that traverses each arc exactly 
2 k times along the digraph B n -(/t+i)- ■ 


Corollary III.2 Let 5 n be a complete binary sequence of length 2" with an 
associated (n-k)-comb, C. Then the sequence, T(C,S n ), of n-tuples extracted 
by the comb specifies a walk that traverses each arc exactly 2 k times along the 
digraph B n .(k+i) where (fc+l)<n. 
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Proof: Follows immediately from Lemma III.l and the definition of the iso¬ 


morphic digraph 2? n -(k+i)- * 

The elements of the set W 2 * for fi n -{*+i) define 2 n -long walks along the arcs of 
the graph B n -(k+i ) where each arc is visited exactly 2 k times. The set W 2 * includes 
all the sequences having an (n - fc)-comb. The following theorem of van Aardenne- 
Ehrenfest and de Bruijn [Ref. 12] allows us to count, in any Eulerian regular directed 
graph D , the number of sequences that define a walk along D where each arc is 
traversed exactly p times. Table II provides the number of sequences defining an 
Eulerian or double Eulerian cycle along B n , for 1 < n < 4. 


Theorem III.3 (van Aardenne-Ehrenfest and de Bruijn ) Let D = (V, A) 
be an Eulerian regular directed graph where in(v) = a. Then the number of 
ways, ||f?|| / >, to traverse each arc in D exactly p times is 



(HI.l) 


where <i> is the Euler’s totient function, ||Z?||i is the number of Eulerian cycles 
in D, and the summation is extended over all divisors of p. 


Theorem III.3 was proved using an interesting but considerably complicated 
argument. The development and proof of the theorem is summarized in Appendix A. 

Let 5 be a binary sequence of length 2” with an (n— l)-comb. By Lemma III.l, 
5 is an element in the set of sequences, W 2 , that specify a double Eulerian cycle along 
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n 

Eulerian 

Double Eulerian 

1 

1 

5 

2 

2 

82 

3 

16 

52,496 

4 

2048 

44,079,843,328 


Table II. The number of Eulerian and double Eulerian cycles along B n , 1 < n < 4 

B n - 2 - From Theorem III.3, the number of distinct double Eulerian cycles along fl„_ 2 
is given by 

|| Bn—1|| 2 = (1 + 3 *-)?*--+*. (III.2) 

Not every sequence, however, in VV 2 has an (n — l)-comb. Finding the general formula 
for the number of sequences with an (n — l)-comb has been elusive. In the follow¬ 
ing two sections we show that the number of sequences of length 2 n possessing an 
(n — l)-comb is much smaller than the number of sequences with an n-comb. 

C. THE RUN STRUCTURE OF SEQUENCES POS¬ 
SESSING (n-l)-COMBS 

It is well-known that the number of de Bruijn cycles of length 2” is 2 2 "~' -n . 
Therefore, of all of the double Eulerian cycles along B n - 2 , there are 2 2 " -I-n that are 
defined by de Bruijn cycles. In this section we show that, statistically, we should 
expect the number of 2 R -long sequences with an (n — l)-comb to be only 2 2 " J-n+2 . 

A run of ZEROs in a binary sequence is defined to be a subsequence of con¬ 
secutive 0’s that is preceeded and followed by a 1. The length of a run of ZEROs 
is the number of consecutive 0’s in the subsequence. A run of ONEs is similary de- 
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fined. A binary sequence can be interpreted as a sequence of integers representing the 
lengths of the alternating runs of ZEROs and ONEs in the sequence. For example, 
the sequence: 

S = 00001010110111010011001000011111 

corresponds to the run sequence R — (4111121311222145). The sequence, R, consists 
of a subsequence Z$ = (41111224) of the run lengths of runs of ZEROs interleaved (or 
perfectly shuffled [Ref. 21 ]) with a subsequence O 5 = (11231215) of the run lengths 
of runs of ONEs. 

The multiset consisting of the lengths of the runs of ZEROs and ONEs in a 
de Bruijn cycle is completely determined, (See Chapter V or [Ref. 6 ]). In fact, a 
run sequence that defines a cycle in W-, in B n can also be determined. For example, 
when 5 is a sequence that defines a double Eulerian cycle along B n , the multiset of 
the lengths of the runs of ZEROs or ONEs in S consists of one of only two possible 
multisets, respectively. 

Theorem III.4 Let S be a sequence defining a double Eulerian cycle along 

B n . Then S satisfies one of the following 2 criteria: 

1. S has 2 n ~ k runs of ZEROs of length k for 1 < k < n and a single run of 
ZEROs of length n + 2, or 

2. S has 2 n ~ k runs of ZEROs of length k for 1 < k < n — 1 and two runs of 
ZEROs of length n + 1. 

The same distribution holds for the runs of ONEs. 







Proof: Since each (n + l)-tuple occurs exactly twice in 5, the longest runs of 
ZEROs must consist of either: 

1. Exactly 1 run of ZEROs of length n -f 2, or 

2. Exactly 2 runs of ZEROs of length n + 1. 

Case 1: The run of ZEROs of length n -f 2 must be preceded and followed by 
a 1, or the (n + l)-tuple (00 • • • 00) would appear at least three times in S. The 
(n -f l)-tuple consisting of a 1 followed by the n-tuple (00 .. .0) also occurs 
exactly twice in the sequence. One occurrence, however, is already accounted 
for by the run of ZEROs of length n + 2. Thus, there is an additional run 
of ZEROs of length n that provides the second (n -f l)-tuple consisting of a 
1 followed by the n-tuple 00 ... 0. Thus, there is no run of ZEROs of length 
n + 1. To find the number of runs of ZEROs of length k, for 1 < k < n — 1, we 
consider all n + 1 consecutive bits of the sequence that begin with a 1 followed 
by the fc-tuple 00 ... 0 and then a 1. Each such run can be made to correspond 
to an arbitrary (n + l)-tuple of the form 

1 QO... 0. 1 xx ... i. , 

k n—k—l 

where the x's are chosen as arbitrary bits. Since we are free to choose each of 
the remaining n — k—l bits, there are x 2 runs of ZEROs of length k 

for 1 < k < n — 1. With the single run of ZEROs of length n and the single 
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n 

1 

2 

3 

4 


Eligible Run Distributions for B n 

{3,1} 

{ 2 , 2 } 

{4,2,1,!} 

{3,3,1,1} 

{5,3,2,2,1,1,1,1} 

{4,4,2,2,1,1,1,1} 

{ 6 , 4 , 3 , 3 , 2 , 2 , 2 , 2 , 1 , 1 , 1 , 1 , 1 , 1 , 1 , 

{5,5,3,3,2,2,2,2,1,1,1,1,1,1,1, 


1 } 

1 } 


Table III. Distribution of runs for double Eulerian cycles along B n 


run of ZEROs of length n + 2 the result follows. The same argument holds for 
runs of ONEs. 

Case 2: The two runs of ZEROs of length n + 1 must each be preceded and 
followed by a 1, or the (n + l)-tuple, 00- 00, would appear at least three 
times in S. The (n + l)-tuple consisting of a 1 followed by the n-tuple 00 ... 0 
occurs exactly twice in the sequence. These, however, are already accounted 
for by the two runs of ZEROs of length n +1. Thus, there is no run of ZEROs 
of length n. In a like manner as above, there are 2 n-fc_1 x 2 runs of ZEROs of 
length fc, for 1 < k < n — 1. With the two runs of ZEROs of length n + 1 the 
result follows. The same argument holds for the run distribution of ONEs. ■ 


Every double Eulerian cycle around B n can be described by a binary sequence 
whose run sequence, /?, consists of a subsequence Z n interleaved with a subsequence 
O n - The subsequences Z n and O n are respectively a permutation of one of the two 
multisets in Theorem III.4. To determine how many sequences have (n — l)-combs, 
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it is useful to partition ail of the double Eulerian cycles into one of the following 3 
sets: V( n , 2 ), P(n,i), and P (n) defined as: 

1. The set, V( n , 2 ), consists of sequences where both the ZERO and ONE run 
sequences are permutations of the multiset that includes the element n + 2. 

2. The set, V(n,i), consists of sequences where both the ZERO and ONE run 
sequences are permutations of the multiset that includes the element n + 1. 

3. The set, V( n ), consists of sequences where the ZERO and ONE run sequences 
are permutations of different multisets. 

The sizes of each of the 3 sets, V( n , 2 ), V( n , i), and V ( n ), are determined in a 
straightforward manner. Equation III.2 provides the total number of double Eulerian 
cycles, ||||a* where ||B n || 2 = |'P ( „, 2 )| + j'P(„,i)| + |^(»)|* Let B n denote the Eulerian 
digraph constructed from the Good - de Bruijn digraph by removing the two arcs 
(loops), (0) and (1). One can see that l^n.i)! is equivalent to the number of sequences 

A 

representing double Eulerian cycles along B n , i.e., |‘P( n ,i)| = ll^nlb- From Equation 
A.5 in Appendix A we find 

|P (n ,i)| = i(2( 2 "- n - 1 )+2-{ 2 " +1 - 2 )2( 2 ”-"- I )l2( 2n - 2 )2 2 ) 

_ 2 ( 2 "- n - 2 ) -|- 2( 2n-n - 2 )3( 2n - 2 ) (III.3) 

= 2< 2n - n - 2 >(l + 3( 2 "- 2 )). 

Now let B n and B n denote Eulerian digraphs constructed from the 
Good - de Bruijn digraph by removing just the loop, (0) and (1), respectively. We 
define Q to be the set of sequences representing double Eulerian cycles along B n . By 
inserting a 0 into the longest run of ZEROs of a sequence Q € Q, a sequence is created 
that contains a run distribution of ZEROs that includes the element (n + 1) tw'ce 
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and a run distribution of ONEs from either of the two eligible multisets. In a similar 
manner, each sequence representing double Eulerian cycles along B n is equivalent to a 
sequence with a run distribution of ONEs that includes the element (n + 2) and a run 
distribution of ZEROs from either of the two eligible multisets. We define V, to be set 
of sequences representing the double Eulerian cycles along £„. The set Q U 71 = U 
is equivalent to the set of double Eulerian cycles that includes all the elements of 
'P(n)- In addition to the elements of V( n ), the set U also includes the sequences that 
have ZERO and ONE run sequences appearing as permutations of the multiset that 
includes the element n + 1, i.e., 'P(n.i) U V( n ) = U. It follows immediately that 

|P| = (lia.ll* - ia„,„|) x 2, 

since ||£„1| 2 = ||f?„|| 2 . By again applying Equation A.5 in Appendix A, we find that 

\V (n) \ = + 2-( 2 " +l - 1 >2< 2 "~ n - 1 >12( 2 "- 1 >2 1 ) - 2< 2 "-"- 2 >(l -l- 3< 2 "- 2 >)) x 2 

= (2 (2 "- n - 2) + 2 (2 "- n - 2 )3< 2 "- 1 > - 2 (2 "- n - 2 >(l + 3< 2 ”- 2 ))) x 2 
= 2< 2n - n >3< 2n - 2 ). 

(HI-4) 

Consequently, it follows that 

l*W)l = l|s n || 2 - l*V)l - \v { n)\ 

(III.5) 

_ 2( 2n -")3( 2 "- 2 ). 

Table IV provides the values for ||£ n || 2 , |P(„, 2 )|, |P(«,i)|, and \V( n) \, for 1 < n < 4. 


38 









n 

IIAIIt 

\P(n.2)\ 

l^n.1,1 

.. PW... 

1 

5 

2 

1 

2 

2 

82 

36 

10 

36 

3 

52,496 

23,328 

5,840 

23,328 

4 

44,079,843,328 

19,591,041,024 

4,897,761,280 

19,591,041,024 


Table IV. Double Eulerian cycles along B n , by category 


D. RANDOMNESS OF THE DOUBLE EULERIAN WALKS 

In this section we show that the number of classical de Bruijn cycles in the 
set P( n ,2) coincides with the expected number of de Bruijn cycles that would exist 
if the sequences in V( n , 2 ) had a particular randomness property. (Recall that of the 
2 (2 "~ n )3^ 2n-2 ) sequences in V(n,i)i exactly 2 2 " +l_n_2 are de Bruijn, since the sequence 
length corresponds to a Hamiltonian path through B n+ 2.) 

A Randomness Property : From any vertex, each departing arc is equally likely 
to be traversed next in a walk along B n defined by the 2 n+2 -long sequence, S n+2 , 
where S n +2 € V( n , 2 )- 

Each vertex in B n is entered 4 times along the walk defined by the sequence 
S„+ 2 . Let the arcs in B„ be labeled with 0’s and l’s as described in Chapter II. 
Suppose vertex v is entered from the arc labeled a\ for the first time. Let X\ be a 
binary indicator variable where X\ = 1 if and only if the next arc traversed from v 
is labeled I. The indicator variable Xi = 0 if and only if the next arc traversed from 
v is labeled 0. Similarly, when a vertex v is entered from the arc labeled c*i for the 
second time, let X 2 be a binary indicator variable where X 2 = 1 if and only if the 
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next arc traversed from v is labeled 1. The indicator variable X 2 = 0 if and only if 
the next arc traversed from v is labeled 0. 

The joint probability function for the two discrete random variables X\ and 
X 2 is given by 

Px x ,x j(xi,x 2 ) = P{X x = x u X 2 = x 2 ), for x u x 2 € {0,1}. 

If Xi ^ X 2 for each vertex in the sequence S n + 2 is de Bruijn of length 2 n+2 , since 
each (n + 2)-tuple is distinct. The probability that X\ ^ X 2 for each vertex in B n for 
the walk S n+2 can be computed quite easily. 

Let t; be a vertex that is not incident with a loop in B„. Since S n + 2 defines a 
walk where each outgoing arc from v is equally likely to be visited next and each arc 
in B n is visited exactly two times, it follows that 

P(Xi = 1|X, = 0) = J, 

where the bar within the parentheses indicates conditional probability. Furthermore, 
if v is a vertex incident with a loop then, 

P(X 2 = l\Xx = 0) = 1, 

since each arc in B n must be visited twice and we must visit the loop at this time or 

/ \ 2 n —2 

miss it entirely. Therefore, the probability that 5„+2 is de Bruijn is p = (f j 

If the sequences in V( n , 2 ) have the property that they each define a walk by 
which subsequent arcs from each vertex are equally likely to be traversed, we would 
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expect 

( 9 \ 2"-2 / 0 \ 2 n —2 
|) = x (|) = 

sequences in V( n , 2 ) to possess the de Bruijn property. Since there are exactly 2 2 " +l-n_2 
de Bruijn cycles in P( n , 2 ), this expectation is realized. In the next section, we discuss 
the implication of this randomness property with respect to the expected number of 
complete cycles associated with an (n — l)-comb. 

E. THE NUMBER OF COMPLETE CYCLES 

Counting the exact number of sequences associated with an (n — Ar)-comb, 
1 < k < 2 n_1 — 1, remains a difficult problem. We can, however, estimate the size 
of the set of complete cycles associated with a particular comb. One could postulate 
that there are equal numbers of sequences for each possible comb, especially since the 
previous section suggests that the successive bits (0 and 1) are equally likely among 
the set of eligible sequences. It has been found by a computer search that sequences 
with n-combs tend to be much more common than sequences with (n — fc)-combs, for 
k > 1. This result is supported by the same probabilistic argument developed in the 
previous section. 

For simplicity, we again utilize the sequence S n + 2 defined in Section D. Here we 

n—1 

find the probability that 5 n+2 supports an (n — l)-comb of the type H " ' jj *L Suppose 
that vertex v is entered from the arc labeled Qi for the first time. Then, the indicator 
variables Xi and are defined as before, except that now we are concerned with the 
second arc traversed from v. Therefore, we again let Xi be a binary indicator variable 
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where X\ = 1 if and only if the second arc traversed from v is labeled 1. The indicator 
variable X\ = 0 if and only if the second arc traversed from v is labeled 0. Similarly, 
when vertex v is entered from the arc labeled Qi for the second time, let X 2 be a 
binary indicator variable where X 2 = 1 if and only if the second arc traversed from 
v is labeled 1. The indicator variable X 2 = 0 if and only if the second arc traversed 
from v is labeled 0. 

In contrast to the situation described in Section D, the inequality of Xi and X 2 
for all vertices in B n is not a sufficient condition for S n+2 to support an (n — l)-comb. 
It must also be the case that Y\ ^ Y 2 , where Y\ and Y 2 are similarly defined on the 
other arc, a 2 , entering the vertex v. Therefore, when vertex t; is entered from the arc 
labeled a 2 for the first time, let Yi be a binary indicator variable where Yi = 1 if and 
only if the second arc traversed from v is labeled 1. The indicator variable Yi = 0 
if and only if the second arc traversed from v is labeled 0. Similarly, when vertex v 
is entered from the arc labeled a 2 for the second time, let Y 2 be a binary indicator 
variable where Y 2 = 1 if and only if the second arc traversed from v is labeled 1 . The 
indicator variable Y 2 = 0 if and only if the second arc traversed from v is labeled 0. 

The joint probability function for the 4 discrete random variables X\, X 2 , Yi, 
and Y 2 is given by 

Px,,x 2 ,K,,y 3 (a;i,22,yi,y2) = P(Xi = = x 2 , Yi = y u Y 2 = y 2 ), 

for Xi,x 2 ,yi,y 2 € {0,1}. If X\ ^ X 2 and Yi ^ >2 for all the vertices in the 

n—1 

sequence has a comb specified by j ‘" The probability that X\ ^ X 2 for each 
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vertex in B n not incident with a loop is §. Given that X\ ^ X 2 , the probability that 
Vi ^ Y 2 is Therefore the probability of X\ ^ X 2 and Y\ ^ Y 2 is p « 5 . 
Furthermore, if v is a vertex incident with a loop then, 


P(X 2 = 1 )*! = 0) = 1, 


since each arc must be visited twice. 

n—1 

II 11*1 comb is approximately ( 5 ) 


Therefore, the probability that 5 „+2 
We would expect to find that 


has a 


/ In 2 "- 2 / 1\ 2 "- 2 

x ( 5 ) = x ( 5 ) = 2 2 "“" 


of the sequences in V( n , 2 ) ^ ave these combs. For n = 3, of the 23,328 sequences in the 
set V, there are exactly 2 5 combs of the type M*L supporting our claim. In general, the 
experimental results support the probabilistic predictions. We have found that the 
sizes of the set of 2 n -long sequences with (n — l)-combs, of all types, is substantially 
smaller than 2 2 "~ ,-n , the number of de Bruijn sequences of length 2 n . 

In the following chapters we develop the necessary theoretical concepts on 
double Eulerian cycles to analyze the Good - de Bruijn digraphs and to gain a better 
understanding of complete cycles. A complete combinatorial explanation for the 
number of sequences satisfying a particular comb is an intriguing goal that seems 
very hard to achieve at present. 
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IV. A MEASURE ON A GRAPH 

“Would you tell me, please, which way I ought to go from here?” 
u That depends a good deal on where you want to get to, ” said the cat. 

“ / don’t much care where ...” said Alice 
“Then it doesn't matter which way you go, ” said the cat. 

Alice’s adventures in Wonderland, Chapter 7 

A. INTRODUCTION 

In this chapter we develop the theoretical concepts regarding double Eulerian 
cycles, i.e., walks on a connected graph or weak digraph that visit every edge exactly 
twice. A measure can be given to the visitation pattern of a double Eulerian cycle. 

Every de Bruijn cycle (and some nonclassical de Bruijn cycles) of length 2" 
defines a walk that visits every vertex on B n , traverses each arc of B n -i, and passes 
through each arc on B n -2 exactly twice. From these latter walks emerges a measure 
of complexity of the sequence in question and a notion of the value for the underlying 
graph. 

B. THE VALUE OF A GRAPH 

We define W 2 to be the set of walks that traverses each edge (arc) in the graph 
(digraph) exactly twice. The set Wj is called the set of double Eulerian cycles. We 
define a measure on a walk in W 2 as the sum, over ail edges of the graph, of the 
positive difference of the visitation times on each edge. The value of a graph G is 
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Figure 11 . Double Eulerian cycles with visitation times 


then defined to be the minimum measure over all double Eulerian walks on G. In 
addition, every connected graph always has a value. 

In Figure 11, two different double Eulerian walks are presented. The numbers 
in parentheses are the visitation times. Walk A = (v t ,v 2 , v 3 , v 4l v s , v 2 , v 3 , v 4 , v 5 , v 2 , Vi) 
has a measure of 25. Walk B = (v 2 ,t 7 i,t> 2 ,t> 5 ,t> 4 ,t> 5 , v 2 ,v 3 ,v 4 ,v 3 , v 2 ) has a measure of 
9, which can be shown to be the value of the graph. 


Theorem IV. 1 Let G be a connected graph. Then G has at least one double 
Eulerian walk. 

Proof: Let H be the graph generated by duplicating each edge in G. The 
degree of each vertex in H is then even. Therefore, Theorem II. 1 implies that 
H has a closed Eulerian trail and G must have a double Eulerian cycle. As an 
immediate corollary, a double Eulerian cycle in G yields an Eulerian walk in 
H. ■ 
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Corollary IV. 2 Every double Eulerian cycle in a grapi . G is a closed walk. 

Proof: Let H be the graph determined by duplicating each edge in G. From 
Theorem IV. 1, a walk in H is Eulerian if and only if the walk is double 
Eulerian in G. The degree of each vertex in H can be counted by following 
an Eulerian walk W\ in H. Each occurrence of a vertex along W\ adds 2 to 
the degree of an interior vertex in W 1 and 1 to the degree of the initial and 
terminal vertices (recall that a walk is a sequence of vertices and edges). Since 
the degree of each vertex in H is even, the initial and terminal vertices of the 
walk must be the same vertex. Therefore, every double Eulerian cycle in G is 
closed. ■ 

Since a double Eulerian cycle is closed, the edge sequence representing a double 
Eulerian cycle can be viewed as a cycle. Alternatively, we can also define a measure on 
a walk in W 2 as the sum over all edges of the distance along the cycle between the two 
occurrences of each edge. From this perspective, the cycle distance between identical 
edges in W 2 remains constant for each cyclic shift of the edge sequence representing 
the double Eulerian cycle. In Figure 12, A = (e 5 ,ei,e 2 ,e 2 ,e 3 ,e 3 ,ei,e s ,e 4 ,e 4 ) is a 
double Eulerian walk. As before, visitation times are in parentheses. The distance 
between the first appearance at each edge ei, e 2 , e 3 , e 4 , es and the second appearance 
in walk A are 5,1,1,1,3, respectively. Walk B = (ej,e 2 ,e 2 ,e 3 ,e 3 ,ei, e 5 , e 4 ,e 4 ,es) is the 
sequence generated by cyclically shifting walk A by one. The distances between the 
two occurrences of each edge in walk B are unchanged. 
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t>4 v s v 4 v s 

Figure 12. Double Eulerian cycles with visitation times (cycled) 

In a graph G with size q, the sequence of edges /j, /a,..., /a 9 in a double Eule¬ 
rian cycle completely determines the measure of the walk. More precisely, the measure 
of a walk is ultimately determined by the number of edges previously traversed ex¬ 
actly once (called single-edges or a-edges) along the double Eulerian cycle. When 
an edge e has been traversed twice (or not at all) during the walk, subsequent edges 
along the walk can neither increase nor decrease the difference of the two visitation 
times for that edge. When an edge e', however, has been traversed exactly once during 
the walk, each subsequent edge traversed along the walk increases the difference of the 
visitation time for e' by one. Thus, it is the number of current <7-edges at each step 
along the walk that determines the measure of the double Eulerian cycle. Finally, 
as stated before, the aggregate sum of the (7-edges at each vertex along the double 
Eulerian cycle, is the measure of the walk. From this perspective, the measure of a 
double Eulerian cycle can be calculated as follows: 

1. Transform the sequence of edges in a double Eulerian cycle into a 

sequence, {a n }^Li °f l’s and -l'sas follows: 


4 ' 







Edge {x,y} = 


{ 1 if this is the first occurrence of the edge {x,y} 
on the double Eulerian cycle 
—1 otherwise. 


29 

2. Let {£ n }n=i t> e the sequence of partial sums of the series ]jP a„. 

n=l 

2q 

3. The measure of the walk is t y 

k= 1 


For example, consider walk A in Figure 11. We find the following: 


1. KU° =1 = {1,1,1,1,1, -1, -1, -i, -1, -l}. 

2 . {£«});=! = { 1 , 2 , 3 , 4 , 5 , 4 , 3 , 2 , 1 , 0 }. 

29 

3. The measure of walk A is ]jT] tj = 25. 

fc=i 

It is interesting to note that set of possible sequences {a n }nLi °f l’s and -l’s 
for a graph G defined in this way is a subset of all of the (north/east) routes between 
opposite corners on a q x q lattice that are on or below the diagonal, where q is the 
size of G. It is well-known [Ref. 22] that the total number of subdiagonal routes is 

where C n is the nth Catalan number. 

In the remainder of this section we show that the value of a graph G can be 
determined as a function of the edge-status, es v , of a vertex v where v € EM(G) and 
q , the size of G. First we develop some results relating the concepts of status and 
edge-status. 
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Theorem IV.3 Let T = (]/,£) be a tree. Then es v = s v — q where es v is the 
edge-status of v, s v is the status of v, and q is the size of T. 

Proof: If a tree T has one edge, it follows immediately that es v = s v — q. 
Assume T has more than one edge. Let v be a root of T that gives T a height 
of h. Assume e\ and e 2 are two edges such that d e (v,e i) = d e (v,e 2 ) = k and 
where e\ and e 2 are incident with the same vertex U3 with d(v, v 3 ) = k+1. Then 
(u,..., e\, U3) and (v, ..., e 2 , v 3 ) are two different walks from v to the vertex 
v 3 . This infers a cycle in T. Therefore, each edge e € E, where d e (v, e) = i, 
0 <i < h — lis incident with a unique vertex n 6 V — v where d(v, n) = * + 1. 
Since |£| = |V| — 1, it follows that 

es v = '%2d e {v,e)= £ d(v,e) - 1 = s v - q. 
e€E ngV—v 


The following corollary follows immediately from Theorem IV.3. 

Corollary IV.4 LetT be a tree, with edge-median EM(T) and median M(T). 
Then EM(T) = M(T). 

Unfortunately, if a graph G is not a tree, it is not necessarily true that 
EM(G) = A/(C?). Graph G in Figure 13 has EM(G') — {t>$} and M(G) = {vs}- 
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G: 

Figure 13. Example for which EM(G) ^ M{G ) 

The next result provides a lower bound for the measure of a walk in W 2 , where 
v € V on a connected graph G = (V, E). 

Lemma IV.5 Let G = (V, E) he a connected graph of size q. Let v € V, with 
edge-status es v . If m is the minimum measure of a double Eulerian cycle W 2v 
beginning at v, then m > 2 (es„) + q. 

Proof: Let v be the initial vertex of a double Eulerian cycle in G. Let 
maxd e (u, e) = n. Since a double Eulerian cycle is closed, all of the mem- 
bers of the set of edges of edge-distance j, 1 < j < n, from v must be visited 
twice before all edges of edge-distance j — 1 are visited twice. It follows that 
there are at least j <r-edges when any edge e ; - in the set of edges of distance 
j from v is visited for the first time in a double Eulerian walk. Furthermore, 
there are at least j + 1 <r-edges when ej is traversed the second time in the 
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walk (since e : is now included in the set of <7-edges). Therefore, each edge e } 
at edge-distance j from v increases the measure of a walk by at least 2j -f 1. It 
follows that if m is the minimum measure of a double Eulerian walk beginning 
at v , then 

m > 'V (2 (d e (v, e)) + 1) = 2{es v ) + q. 

e£E 


We now describe an algorithm (Algorithm A) to construct a double Eulerian 
cycle of minimum measure on a connected graph. The algorithm includes a subroutine 
Cyclel that is recursively called throughout the algorithm. Informally, Algorithm A 
is very similar to the depth-first search algorithm [Ref. 23]. The algorithm proceeds 
from the initial vertex v in a forward direction (adding new edges) for as long as this 
is possible. When it is no longer possible to advance (add a new edge), the algorithm 
backtracks to the first vertex from which it is then possible to go forward revisiting 
edges as it goes. The algorithm proceeds until each edge is visited exactly two times. 
Each new edge added by the algorithm has the following properties: 

1. The new edge is not in the current walk, 

2. The new edge is incident with the last visited vertex in the walk, and 

3. The edge-distance from v to the new edge is exactly one greater than the 
edge-distance from v to the last visited cr-edge in the walk. 
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It follows from the above that when no new edge can be added to the walk, the next 
edge traversed is the last visited <r-edge in the walk (i.e., the <7-edge of greatest edge- 
distance from the initial vertex v). This is in essence the backtracking aspect of the 
algorithm. 

ALGORITHM A [Constructs a double Eulerian cycle on a connected graph 
G = (V, E) beginning at vertex v € V] 

Input: A graph G = (V, E) of size q and a vertex u as global parameters. 

Output: Array P, an ordered list of edges constituting a double Eulerian cycle 
through G beginning at the given vertex v. 

Parameters N = [n l5 ... ,n 2 ,+i], P = [pi, • • • ,P 2 9 ], and H = [h u ... ,h q ], are global 
arrays of vertices, edges, and edges of size 2q + 1, 2 q, and q, respectively. 

(Note: The parameter d is an integer that denotes the edge-distance of the last visited 
o-edge from vertex v in an ongoing walk. When there are no (7-edges in the walk, d is 
set to -1. The parameter n* indicates the current vertex in the walk. The parameters 
i and j are integer counters indicating the number of edges currently traversed and 
the number of current <r-edges along the walk, respectively.) 
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t>3 *1 v? e 3 V\ 



V 4 e 6 l>s 


Figure 14. A graph to illustrate Algorithm A 


Algorithm A 


Initially set j := 1; i := 1; d —1; ni := 1 /; n * := empty; 2 < k < 2q + 1; 
pi := empty. 1 < / < 2q; and h m := empty; 1 < m < <7. 


Cyclel(lV, P, H,j, i, d); end. 

Subroutine Cyclel (N, P, H,j, t, d) 

if d e (n,, e) = 0 and <Z e (ni, e) = d + 1 and d e (i/, e) = d( v, n,) and e 0 P, for e € E 
then n l+1 := x where e = (n,, x) 

Pi e 
hj := e 

d :=d +1 
i := i + 1 

i == j + 1 

Cyclel(7V,P,/f,j,i,d) 
else pi := hj 

n »+i := «i_i 

d :=d- 1 
J := i + 1 

j := j - 1 

if i = 2g 
then end. 

else Cycle 1 (/V, P, H, j , i, d) 


To illustrate Algorithm A, consider the graph G in Figure 14. Let v 4 be the 
initial vertex u. Both e x and e@ satisfy the criteria for being a new edge to add 
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to the walk. Without loss of generality select t\. Subsequently, both e 2 and e 4 
satisfy the criteria for a new edge to enter the walk. Let e 4 be selected. Continu¬ 
ing, the double Eulerian cycle constructed by Algorithm A is the sequence of edges 
P = [e 1 ,e 4 ,e 4 ,e 2 ,e 3 , e$, e 2 ,ei,e 6 , es, es, eg]. Alternatively, if e 2 had been selected 
rather than e 4 , the sequence of edges P' = [ei,e 2 , 63 , e 3 , e 2 ,e 4 ,e 4 , ei,ee, e 5 ,e 5 ,e 6 ] is 
generated. Both F and F' have the same measure. 

The algorithm adds a new edge to the walk when the three conditions stated 
above are satisfied. Each new edge has an edge-distance from u of exactly one greater 
than the last <r-edge visited. During the walk if a new edge cannot be trav ed, the 
algorithm backtracks through the last visited cr-edge until the criteria for adding a 
new edge is met. When a <x-edge is added through backtracking, the <r-edge to be 
traversed is the one that has the greatest edge-distance from v of all <r-edges currently 
in the walk. This is clearly the last visited <r-edge. The following lemma shows that 
ultimately each edge in the graph is traversed exactly twice. 

Lemma IV.6 Let G = (V, E) be a connected graph. Algorithm A constructs 
a double Eulerian cycle on G beginning at v € V. 

Proof: We prove, by induction on the edge-distance from the initial vertex v, 
that each edge is traversed exactly twice. 

Basis Step: The edge-distance between the first edge e on the walk generated 
by Algorithm A and the initial vertex v is 0. Since there are only a finite 
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number of edges, the edge-distance from t; to each edge is finite. Therefore, 
the walk must eventually backtrack through each of the a-edges of the ongoing 
walk generated by the algorithm. When there is no edge of distance 1 to be 
added after edge e, tnen edge e will be revisited by backtracking. Hence e is 
traversed exactly 2 times. In a similar fashion when the walk continues along 
another edge e' of edge-distance 0 from r, finiteness and backtracking insure 
that e' is traversed exactly twice. 

Inductive Step: Assume that the algorithm traverses each edge e where 
d e (v,e) = k. Let e* + i be an edge of edge-distance k - j- 1 from v. There 
exists an edge e* of edge-distance k incident with edge et+i, else e*+i is not 
of distance k + 1 from v. Let vertex Vk be incident to both e* and e*+i. The 
distance from v to Vk is k, else either the edge-distance to ek is not k or the 
edge-distance to e* +1 is not k + 1. When e* is traversed for the first time, the 
edge e/t+i satisfies the criteria to enter the walk. Either e*+i or another edge 
e fc+i °f edge distance k + 1 will be the next edge added to the walk. If e*+i 
is not the next edge in the walk, then because there axe only a finite number 
of edges and by backtracking, the walk will eventuadly return to u*. The edge 
ek+i still satisfies the criteria to enter the walk. Since there are only a finite 
number of edges incident to Vk, e* +1 must ultimately enter the walk and be 
traversed exactly twice through backtracking. ■ 
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We made the point that when an edge of distance A: from v is added to a walk 
in Algorithm A , it is added when an edge of distance A: — 1 is the last cr-edge in the 
walk. Now we make a further statement about the set of cr-edges when an edge of 


distance k is added to a walk. 

Lemma IV.7 When an edge e k of edge-distance k from the initial vertex of 
the walk is traversed for the first time using Algorithm A, there are exactly k 
edges that have been previously traversed exactly once. 

Proof: We prove, by induction on the edge-distance, that when an edge e k of 
edge-distance k from the initial vertex v of the walk is traversed for the first 
time using Algorithm A, there are exactly k a-edges on the walk. 

Basis Step: When an edge e of edge-distance 0 is traversed for the first time, 
we are at the vertex v and there are no cr-edges remaining in the walk. 

Inductive Step: Assume that when any edge of edge-distance k is traversed 
using Algorithm A for the first time there are exactly k cr-edges in the ongoing 
walk. Let d e (v, e k +i) = k- f 1 . The edge e*+i is initially traversed only when the 
last visited cr-edge e in the walk has an edge-distance of k from v. Therefore, 
by the inductive hypothesis, there are then exactly k + 1 cr-edges after e k +\ 
initially enters the walk. ■ 
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Now we can relate the measure of double Eulerian cycles in a graph G to the 
edge-status of G. 

Theorem IV.8 Let G = ( V , E) be a connected graph of size q. Let v € V, 
with edge-status es v . If m is the minimum measure of a double Eulerian cycle 
W v beginning at v, then m = 2(es„) + q. 

Proof: Algorithm A constructs a double Eulerian cycle through a graph. 
When an edge e, of edge-distance j from the initial vertex in the walk is 
traversed for the first time using Algorithm A, Lemma IV.7 states there are 
exactly j (T-edges. This then adds j to the measure of the walk (1 for each of 
the j <r-edges). When an edge e, of edge-distance j is traversed by Algorithm 
A for the second time, there are exactly j -j-1 <r-edges (the edge e : now belongs 
to the set of <r-edges). This adds j + 1 to the measure. Therefore, the measure 
of a double Eulerian cycle constructed by Algorithm A is 

£ (d e (i>, e) + d e (v, e) A 1 ) = ( 2 <f e (v, e) + 1 ) = 2 (es v ) + q. 

e€£ e€E 

It follows immediately from Lemma IV.5 that the minimum measure of a 
double Eulerian cycle W 2v beginning at vertex v is 2 (es v ) + 9 . ■ 

Corollary IV.9 Let G be a connected graph. Let W 2v be a double Eulerian 
cycle on G that produces the value ofG. Then v € EM(G). 
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G: 

v A <1 

Figure 15. To illustrate the value of a graph 
Proof: Follows directly from Theorem IV.8 and the definition of EM(G). ■ 

Corollary IV. 10 Let G = {V,E) be a connected graph of size q with a value 
of m. Let v € EM(G), with edge-status es v . Then m = 2(es„) + q. 

Proof: Follows immediately from Theorem IV.8. ■ 

Corollary IV.ll Let T = (V, E) be a tree of size q. Let v € V, with edge- 
status es v . The measure m of a double Eulerian cycle W v in T ism = 2(s v )—q. 

Proof: Follows directly from Theorem i - • and Theorem IV.8. ■ 

We have shown that the value of a connected graph G is determined by the 
edge-status, es v of a vertex v € EM(G) and by q the number of edges in G. Algo¬ 
rithms to find the median of a graph can be found in [Ref. 24] and [Ref. 25]. These 
algorithms can be modified in a straightforward manner to determine the distances 
from vertices to edges and the edge-median of a graph. 
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We discussed earlier that the measure of a walk W € W 2 can be defined using 
the distance between the two occurrences of each edge on the cycle W. If each cyclic 
shift of a cycle is considered equivalent, this definition assigns the same measure to 
equivalent cycles. In Figure 15, the double Eulerian cycle, 

W = (cj, es, e 4 , e 3 , e 3 , cj, e 2 , c 4 , e®, e®, es, ®i), 

and each cyclic shift of W has the measure 12 when using the distance between 
identical edges to calculate the measure. 

C. THE VALUE OF A DIRECTED GRAPH 

The following well-known theorem is stated without proof. See, e.g., [Ref. 19]. 

Theorem IV.12 A weak digraph is Eulerian if and only if for every vertex 
its in-degree and out-degree are equal. 

From this theorem, the next result follows readily. 

Lemma IV.13 Every double Eulerian walk in a digraph D is closed. 

Proof: Let P = (u = u 0 ,ui,... , tt 2 * = v) be a double Eulerian cycle in the 
digraph D = (V, A) where |A| — k. If u ^ v, then v appears an even number 
of times on P, since each incoming arc on v is used exactly twice. Since v is 
the terminal vertex of P, however, v has been exited an odd number of times. 
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This is a contradiction, since P is double Eulerian. Thus ti = v, and P is a 
closed walk. As an immediate corollary, in(u) = out(u), and D is Eulerian. ■ 

The following theorem shows that the Eulerian and double Eulerian property of a 
digraph are equivalent. 

Theorem IV. 14 A weak digraph D is double Eulerian if and only if D is 
Eulerian. 

Proof: Suppose that a digraph D is double Eulerian. Since a double Eulerian 
cycle provides a path between any pair of vertices, the digraph must be strong 
(and hence also weak). By Lemma IV. 13, the in-degree and out-degree of each 
vertex is even. Therefore, D is Eulerian. Conversely, suppose D is Eulerian. 
Traversing an Eulerian trail exactly twice yields a double Eulerian cycle. ■ 

The Eulerian graph can be partitioned into edge disjoint cycles. 

Theorem IV.15 A digraph D has an edge-factor if and only if it is Eulerian. 

Proof: Let D be an Eulerian digraph. A closed Eulerian trail on D is an edge- 
factor. Conversely, let F be an edge-factor of the digraph D. The in-degree 
and out-degree of any vertex in each cycle of F are equal. Since the arcs on 
the cycles in F partition the set of arcs in D, the in-degree and out-degree of 
each vertex in D must be equal. Therefore, D is Eulerian. ■ 
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By Theorem IV. 15 every double Eulerian cycle W 2v along a digraph D defines 
a specific edge-factor F. Given F we can obtain a lower bound for the measure of 
W 2v . 

Theorem IV.16 Let F = {C\, C 2 ,..., Cj} be an edge-factor of the digraph 
D. The measure, m, of any double Eulerian cycle W 2w in D satisfies 

m > £ |C.| ! + 

1=1 

Proof: We prove this by using strong induction on k, where k is the largest 
number of cycles in any edge-factor of D. 

Basis Step: Let I? be a digraph such that for any edge-factor F of D, |F| = 1 
(i.e., no edge-factor of D has more than a single cycle). Therefore the set E 
of cycles in each edge-factor of D is the set of Eulerian cycles in D. Then 
F — {R} where R € E. Every double Eulerian cycle in W 2 , along D is exactly 
two traversals of such a cycle R. Therefore, the measure of any walk in W 2 „ is 

ceF 

(Note: p v {F) = 0 since all vertices v are on R and C = R is the only cycle in 

n 

Inductive Step: Let F = {Ci, C 2 ,..., Cj} be an edge-factor of the digraph D. 
Assume that the value m of any double Eulerian cycle W 2v in D satisfies 

m > £l Ci\ 2 + p v (F). 

i=i 
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Let D be a digraph satisfying max |F| = n, and let W 2v be a double Eulerian 
cycle on D. Since D is finite, then at some point along W 2v a vertex is repeated. 
This defines a cycle in D. Let C be the first cycle completed along W 2v . Let a 
be the k ih arc traversed on C. After a is initially traversed there are exactly 
k <r-arcs in C. Let a' be the j tk arc on C to be traversed for the second time 
along W 2v . After a' is traversed for the second time, there Me \C\ — j <r-arcs 
in C. Furthermore, each step along C increases the difference of the visitation 
time by one for each cr-arc not in C. Since each arc in C is traversed twice, C 
increases by 2 the difference of the visitation times for exactly one arc of each 
edge-distance i from v (by the definition of C), where 0 < i < d e (v,C) — 1. 
Therefore, the set of arcs in C taken together increases the measure of the 
walk W 2v by some 

\c\ \c\ 

> E * + E(l c l - J) + 2KW», C) = |C| 2 + 2| CK(®, C). 

k-\ j-\ 

Let D* = D — (C) v . After we remove C, the edge-factor for each component 
of D' has at most n — 1 cycles. Let F' = {C[, C 2 , • •., C'j) be an edge-factor 
of the digraph D By the inductive hypothesis the measure m of any double 
Eulerian cycle on any component G of U satisfies 

i=i 
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Therefore, if F = {Ci, Cj ,..., C/} is an edge-factor of the digraph D, then 
the measure m for any double Eulerian cycle W 2t , along D satisfies 


»=I 


We now describe an algorithm (Algorithm D) that constructs a double Eulerian 
cycle of minimum measure, beginning at a vertex v on an Eulerian digraph. Algorithm 
D adds cycles to create a double Eulerian cycle whose measure attains the lower bound 
of Theorem IV.16. This algorithm is similar to Algorithm A. The reader should make 
note of the similarities and differences between the roles played by the edges of a graph 
in Algorithm A and the cycles in the Eulerian digraph in Algorithm D. Informally 
speaking, Algorithm D moves forward , entering cycles of greater distance from a 
vertex v for as long as this is possible. When it is no longer possible to enter a new 
cycle at greater distance, the algorithm traverses each arc on the current cycle exactly 
twice and then backtracks to the previous cycle. The backtracking continues to the 
first cycle from which it is possible to go forward to enter new cycles. It is clear that a 
cycle C may be entered at several places along a given walk. The first edge traversed 
on C cannot be at distance d > d e {v,C). Algorithm D does not allow a cycle to be 
entered for the first time at any vertex other than a vertex of distance d e (v,C). 

Algorithm D includes a subroutine Cycle2 that is recursively called through¬ 
out the algorithm. Algorithm D constructs a double Eulerian cycle W 2v by succes- 



sively selecting cycles from an edge-factor to be traversed. Each new cycle C added 
has the following properties: 

1. There are d e (v,C) arcs in W v that have been traversed exactly once when C 
is entered. 

2. No arc on the cycle C has previously been traversed. 

3. If n is the last vertex encountered along the walk, then n € (C)„ and 
d(v,n) = d e (v, C). 

ALGORITHM D [Constructs a double Eulerian cycle on an Eulerian digraph be¬ 
ginning at vertex u] 

Input: An Eulerian digraph D = (V, A) of size q , a vertex v € V, and edge-factor 
C = {ci, ca,..., c/} as global parameters. 

Output: Array P, an ordered list of the arcs constituting a double Eulerian cycle 
through D beginning at the given vertex u. 

Parameters: N = [i»i,... ,n 2?+ i], P — [pi,... ,pa 9 ], and M = [mi,...,m/], are 
global arrays of vertices, arcs, and integers of size 2q + 1, 2 q, ana /, respectively. 
Parameters i, j, and t are integers. 

The following is a brief description of specific parameters: 

1. The parameter n, indicates the current vertex in the walk. 

2. The value of mi indicates the following about the arcs in the cycle c/: 

(a) If mi — 0 then no arcs on Ci are currently on the walk, 

(b) If mi = 1 then at least one, but not every arc in c/ is currently on the walk, 

(c) If mi = 2 every arc in q has been traversed at least once. 

3. The parameter t indicates the number of <r-arcs currently in the walk. 
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Algorithm D 

Initially set j := 0; i := 1; t := 0; nj := u\ n* := empty, 2 < k < 2q + 1; 
p t := empty, 1 < s < 2q\ and m* := 0, 1 < h < f. 

Cycle2(iV, P, M, C, j, t, f); end. 

Subroutine Cycle2(./V, P, M, (7, j, i, t ) 

(Dl) if [(d(i/,n,) = d e (i/,c/) = t] and [m; = 0]] where n; € cj,/ € {1,2,...,/} 
then j := / + 1 
Temp := c; 

ci := Cj 

Cj := Temp 
Temp := mi := mi + 1 
m/ := mj 
mj := Temp 

n t+ i := x where (n t , x) € Cj 

Pi := 

*:=:* + 1 
t t + 1 

Cycle2(iV , P, M,C,j,i,t) 

(D2) else if [[<f(i/,n,) = d e (i/,Cj) = (t — |cj|)] and [mj = 1]] where m € c } 
then rii + i := x where (n,-,x) € Cj 

Pi := (n„ni+i) 

i := i + 1 

mj := mj + 1 

t := t - 1 

Cycle2(iV, P, Af, C,j, i, t) 

else if [[d(u, n,) = d e (i/, Cj) = f] and [mj = 2]] where n, € Cj 
then j := j - 1 

Cycle2(iV, P, Af, C,j,i,t) 

else if mj = 1 

then rij +1 := x where (nj,x) € c ; 

Pi := (ni,n t+ i) 
i :=i + 1 
t :=t + l 

Cycle2(iV, P, Af, C,j,i,t) 

else if mj = 2 

then rii + i := x where (n;,x) € Cj 
p, := (ni,x) 

*:=* + ! 
t := f - 1 

Cycle2(iV, P, M, C, j, i, f) 
else end 


(D3) 

(D4) 

(D5) 

(D6) 
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Figure 16. A digraph to illustrate Algorithm D 


The following is a brief description of the conditional lines in Algorithm D: 


1. Line (Dl) insures that: 

(a) There are d e (v, c/) <7-arcs in the walk when c t is initially entered since 
d e {v, c t ) = t, 

(b) No arc on the cycle q has previously been traversed (m/ = 0), 

(c) The distance to the last vertex in the walk equals the edge-distance to q 
from the initial vertex z/, i.e., d(i/,n,i) = d e (i/,q). 

(The current cycle goes from Cj to c J+ i.) 

2. Line (D2) insures that at the completion of one traversal of the cycle cj another 
cycle is not entered for the first time, rather, the walk continues to traverse Cj 
for a second time. 

3. Line (D3) insures that when every arc in a cycle Cj is traversed exactly twice, 
the walk continues along the cycle c ; _i. 

4. Lines (D4) and (D5) guide the walk along the current cycle cy unless it is time 
to initially enter the cycle Cj +1 (Dl), to reenter the cycle c_,_i (D3), or to begin 
the second traversal around the cycle Cj (D2). Line (D4) is satisfied during the 
first traversal of c, while (D5) is satisfied during the second traversal of c r 

5. Line (D6) ends the procedure. 
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To .ilustrate Algorithm D, consider the digraph D in Figure 16. Let C be 
the edge-factor consisting of the cycles X — ( 07 , 09 , ag), Y = ( 01 , 05 , 06 , 010 ), and 
Z = ( 02 , 03 , 04 ). Initially set ni = vi\ p, := empty, 1 < s < 20; M = [0,0,0]; 
C = [X, Y, Z]\ j := 0; i := 1; t := 0; 

The conditional lines in Algorithm D are satisfied in the following order: 


• Cycle2(A r , P, M, C,0,1,0) is called and the conditions for line D1 are satis¬ 
fied with ci = C 2 = Y. The values for the parameters are currently j = 1 , 
C = [Y, X, Z], M = [1,0,0,... ,0], ri 2 = v 2 , p\ = ai, i = 2, and T = 1. 

• Cycle2(W, P, M, C, 1,2,1) is called and the conditions for line D1 are satis¬ 
fied with ci = C 3 = Z. The values for the parameters are currently j = 2, 
C = [X, Z, Y], M = [1,1,0], n 3 = u 3 , p 2 = < 12 , * = 2, and T - 2. 

• Cycle2 (jV, F*, Af, C, 2,2, 2 ) is called and the conditions for line D4 are satisfied. 
The values for the parameters are currently j = 2, C = [X , Z,Y], M = [ 1 , 1 , 0 ], 
«4 = « 4 , Pz — <* 3 , * = 3, and T = 3. 

• Cycle 2 (/V, P, Af, C, 2,3,3) is called and the conditions for line D4 are satisfied. 
The values for the parameters are currently j = 2, C = [X, Z, Y], M = [1,1,0], 
n 5 = u 2 , P 4 = a 4 , i = 4, and T = 4. 

• Cycle 2 (A r , F, A/,C, 2,4,4) is called and the conditions for line D2 are 
satisfied with Cj = c 2 = Y. The values for the parameters are currently 
j = 2, C — [X,Z, K], M = [1,2,0], n 6 = t> 3 , Ps = a 2 , i = 4, and T -3. 

• Cycle2(7V,F,A/,C, 2,4,3) is called and the conditions for D5 are satisfied. 
The values for the parameters are currently j = 2, C = [X, Z, T], M = [ 1 , 2 , 0 ], 
”7 = u 4 , p 6 = a 3 , i = 4, and T — 2. 

• The remaining conditional lines are satisfied as follows: D5, D3, D4, D4, D4, 
D2, D5, D5, Dl, D4, D4, D2, D5, D5, D3, D4, and D 6 ends the program. 


We find the sequence of arcs traversed to be 


P = [ai,o 2 , a 3 , a 4 , a 2 , a 3 , a 4 , a 5 , a 6 , aio, ai, 05 ? ^ 6 , 09 , as, 07 , 09 , ag, 07 , ai 0 ]. 
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We now show that Algorithm D constructs a double Eulerian cycle W whose 
measure achieves the lower bound given in Theorem IV. 16. First we need to show a 


relationship between adjacent cycles in an edge-factor of a digraph. 

Lemma IV. 17 For an edge-factor F of D = (V, A), let C € F and v a vertex 
where d e (v,C) > 0. There exists a cycle P € F adjacent to C such that 
d e (v, P) < d e (v,C). 

Proof: Let F be an edge-factor of D where C, P € F. Let d(v,x) = d e (v,C) 
where x € (C) v and {a l5 ... ,<*,} is the set of arcs incident to x. Without loss 
of generality, let and a 2 be arcs in C. Then the minimal length walk W 
between v and x does not include either of a\ and q 2 else there is a vertex 
z € (C% such that d(v, z ) < d(v, x). Without loss of gererality, let a* be the 
arc incident to i in W. Therefore, C is adjacent to a cycle P where x £ {P) v 
and d e (v, P) < d e (v,C). ■ 

We now show that the walk constructed by Algorithm D visits every edge 
exactly twice. 

Lemma IV. 18 For a given edge-factor F = {Ci,C 2 ,.. .,C/} of the digraph 
D = (V, A) and a vertex v € V, Algorithm D constructs a double Eulerian 
cycle along D beginning at vertex v. 
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Proof: We prove, by strong induction on the edge-distance to a cycle C € F 
from the initial vertex v, that each arc in D is traversed exactly twice. 

Basis Step: Let F be an edge-factor of a digraph D. The initial step of the 
walk beginning at vertex v is along a cycle Co where d e (v, C 0 ) = 0. The walk 
continues along the arcs of Co until each of the 3 criteria (on page 64) are 
satisfied for a new cycle to enter the walk. The edge-distance from v to a new 
cycle added to W v is greater than the distance of the current cycle. As there 
are only a finite number of arcs in each cycle and only a finite number of cycles, 
eventually during the traversal of any cycle there will be no adjacent cycles of 
greater edge-distance that have not been initially entered. When the current 
cycle is exited it is to a cycle of smaller edge-distance from v. This can only 
occur after each of the arcs of the current cycle have been traversed exactly 
twice. Since there are only a finite number of cycles and Co is the cycle of 
smallest distance on the walk, Co must eventually be revisited and traversed 
exactly twice. A similar argument insures that every cycle of distance 0 is 
traversed exactly twice. 

Inductive Step: We now assume that the algorithm traverses each cycle C n of 
distance n from v exactly twice for ail n < k. Let C* be a cycle of edge-distance 
k from v. By Lemma IV.17 there exists an arc a*_i of edge-distance k — 1 on 
a cycle Ci adjacent to C* where d t (v,Q) = i, 0 < i < k. Therefore, on the 
first or second traversal of Ci, the conditions for the new cycle Cjt to enter the 
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walk are satisfied and Ck is entered. From the discussion above, C* will then 


be traversed exactly twice. ■ 

We have seen that Algorithm D completes a double Eulerian cycle. We now 
discuss the number of (7-arcs existant at each step of the walk constructed by Al¬ 
gorithm D. The following lemma shows that we can obtain the lower bound of 
Theorem IV. 16. 


Lemma IV.19 Let F be an edge-factor of the digraph D = (V, A). The min¬ 
imum measure of a cycle W 2v generated by Algorithm D is ^ \C\ 2 -f p v {F), 


where p v (F) is the posture for the factor F. 


ceF 


Proof: By Lemma IV. 18, Algorithm D produces a walk W 2v that traverses 
each arc of an Eulerian digraph exactly 2 times. Furthermore, when the first 
arc, a!, in a cycle Cj of distance j from v is traversed in Algorithm D for 
the first time, there are exactly j <7-arcs along W 2v . This follows from line 
(Dl) of Algorithm D , the conditional statement governing when new cycles 
can initially enter W 2 ,. When the k th arc a* in cycle Cj is traversed for the 
first time, there are exactly j + k — l <r-arcs in W v . Therefore, the first traversal 

\c,\ 

of each arc in Cj adds + k — 1) to the measure of the walk. When the k th 

*=i 

arc of the cycle Cj is traversed for the second time, there are j + |C| + 1 — k 


<7-arcs in W 2v . Therefore, the second traversal of each of the arcs in C } adds 







\C,\ 

£(> + \C\ + 1 — k) to the measure of W 2v . It follows then that including the 

*=i 

cycle Cj in the walk W 2v contributes 

(\c\ \ (\c\ 

(]C(j + * - !)) + 1+ l^l +1 - fc) 

U± j£l -1)0 +1£1) (j - i)(i) U + \C\)(j + jgl +1) (j)(j + i) 

2 2 2 2 

= |C| 2 4- 2j\C\ to the measure of W 2v . Summing over all cycles in F yields 

Y,m 2 +a\c\)='E(\c\* + 2<uv,c)\c\)= £ icf+MF). 

C€F C€F C€F 

Hence, the minimum measure of W 2v is min |C| 2 + p v (F). U 

F C€F 


Theorem IV.20 Let F be an edge-factor of the digraph D — (V,A). The 

value of D is min ^2 |C| 2 + Pv{F), where p v {F) is the posture for the factor 
F CeF 


F. 


Proof: Follows directly from Lemma IV. 19 and Lemma IV. 16. ■ 


Corollary IV.21 Let D be an Eulerian digraph. Let W v be a double Eulerian 
cycle on D that produces the value of the graph. Then v € Mp(D). 

Proof: Follows immediately from Theorem IV.20 and the definition of Mf(D), 
the mean of the factor F. ■ 
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Distance from v 


Size of the Cycle 



1 

2 

3 

4 

5 

6 

7 

0 

1 

2 

3 

4 

5 

6 

7 

1 

3 

4 

5 

6 

7 

8 

9 

2 

5 

6 

7 

8 

9 

10 

11 

3 

7 

8 

9 

10 

11 

12 

13 

4 

9 

10 

11 

12 

13 

14 

15 


Table V. Average cost of arcs in particular cycles 

We have shown that the value, v, of an Eulerian digraph D is given by 

i/ = min£ \C\ 2 + p v (F), (IV-1) 

F C€F 

where F is an edge-factor of D. 

Table V lists the average amount each arc a adds to the measure of the walk 
when a is in a cycle of size \C\ for 1 < |C| < 7 where 0 < d e (v, C) < 4 from the initial 
vertex v. It follows from the discussion above that the value in the (i,j) position in 
Table V is 2i + j. 

We can modify algorithms that find the median of a graph in a straightforward 
manner to find the posture of a directed graph for a given edge-factor. Finding all the 
edge-factors of a directed graph, however, can be extremely difficult. It is not obvious 
how to determine a priori an edge-factor that produces the double Eulerian cycle of 
minimum measure, hence the value, of a digraph. The following two lemmas, however, 
remove certain edge-factors of a digraph D from consideration when generating a cycle 
with minimum measure. 
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Lemma IV.22 Let F be a reducible edge-factor of D = (V, A) where X € F 
can be reduced to cycles A and B. Then 

E |C|' + d,(o,C)< E |Cf + <f.(o,C). 

CgF Ai b CC.F 

Proof: If X € F can be reduced to the two cycles A and B then either 
d e (v, A) or d e (v, B) (or both) are equal to d e (v, X). Without loss of generality 
let d e (v,A ) — d e (v,X). But d e (u,B) < d e (v,X) + since B can be at 
most half way around A. It follows that 

|Af+ 2(d t (o,A-))PC| = |A+B| ! + 2(d«(u,X))|A + B| 

= M| ! + |B| S + 2|A||B| + 2(d,(o, X))|A| + 2(d e (u,2f ))|B| 
= |A| a + |B| 2 + 2(d,(u, A))|A| + 2|B|(|A| + d,(v,X )) 

> |A| 2 + |B| 2 + 2(d.(u,A))|A| + 2|B|( J|! + <(,(v.Xf'j 

> |A| 2 + 2(d.(u,A))|A1 + |B| 2 + 2(d«(o, B))1B|. 

Therefore, 

E ici 2 +d«(u,o< $:ici 2 +d t (u,c). 

C€F ab ceF 

■ 

Corollary IV.23 Let F be a reducible edge-factor of D = (V, A). Then a 
double Eulerian cycle that defines F does not produce the minimum measure 
for D. 
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Proof: Follows directly from Lemma IV.22 and Theorem IV.19. 


Equation IV. 1 suggests that minimizing the sum of (|C| 2 )’s in the edge-factor 
should reduce the measure of a double Eulerian cycle. One would expect the cycles 
in the edge-factor that yield a double Eulerian walk of minimum measure to manifest 
the same properties as cycles that minimize the sum of the squares of their lengths. 

Lemma IV.24 Let A = {Ai > A 2 > ... > A< > 0} be a partition of the 

t 

positive integer K. Then A is the partition that minimizes ^ A, 2 if and only if 

t=i 

Ai — A t < 1 

Proof: Let A be the partition that minimizes A, 2 . Assume Ai — A, > 2. 

Then 

(Aj — l) 2 + (At + l) 2 = Aj 2 -f 1 — 2Ai + At 2 + 1 + 2A t 

= -Aj 2 + At 2 + 2 + 2(At - Ai). 

Since At — A! < —2 it follows directly that (Ai) 2 -I- (A t ) 2 > (Aj — l) 2 + (X t +1) 2 - 

Therefore, A t and A t are not in the partition A that minimizes the sum of the 

squares. ■ 

Loosely speaking, a factor whose cycles are as nearly as possible equal sized 
should produce the smallest measures. In the next section we apply this heuristic to 
the Good - de Bruijn digraph to obtain the values of these digraphs. 
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D. THE VALUE OF THE DE BRUIJN DIGRAPH 

Lemma IV.24 shows that an edge-factor that minimizes the sum of the squares 
of the cycle lengths in a digraph minimizes the differences between cycle length. Every 
edge-factor of B n includes at least one cycle of length at least n -(- 1. For example, 
the (n + l)-tuple 000... 1 representing am arc in B n must belong to a cycle of length 
greater than or equal to n + 1. Additionally, if arc a, represented by the (n + l)-tuple 
of all zeros, 000... 0, is not a member of the cycle C = (0) of length 1, then it is 
an arc in a reducible cycle. Corollary IV.23 showed that reducible cycles are never 
permitted in the factor that generates a walk of minimum measure. Therefore, the 
cycle (0) will appear in the factor yielding the value of the Good - de Bruijn digraph 
B n . It follows that the minimum difference between the lengths of cycles is at least n 
for any edge-factor of B n that produces the value. Therefore, if F is an edge-factor of 
irreducible cycles in B n there have to exist cycles in F whose lengths differ by at least 
n. We would want the lengths of the cycles in F to be equal, but we now see that F 
must have a cycle of length 1 and a cycle of at least n +1. The following theorem by 
Golomb [Ref. 6] insures that the cycle sizes in the edge-factor PCRn+i differ by at 
most n. 

Theorem IV.25 (Golomb) Let the edge-factor of B n be generated by the 
PCRn+i. Then the only cycles appearing are those of length d, where d|(n +1). 

In [Ref. 6] it is shown that there are two possible cycles of length 1, (0) and 
(1), only one cycle of length 2, (10), two cycles of length 3, (Oil) and (001), and in 
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r* Jl 
d\n 

summation is extended over ah divisors of n. 


The value of B n is a function of the edge-factor of B n and also of the distances 
to the respective cycles of the factor in the underlying graph of B n . Further, the 
distance between cycles in an edge-factor of B n and the initial vertex v of the minimum 
double Eulerian cycle affect the value of B n . A closer examination of Equation IV.1 
suggests that if the distance to cycles of larger length in F from the initial vertex v 
tend to be smaller than the distances to cycles of smaller length, the edge-factor F 
should generate a smaller measure. 

The directed distance, do(x,y), between x = (x 0 ,Xi,... ,x„_i) and 

y = • • • ,y n -i) in B n is n — 7 where 7 is the longest string of consecu¬ 
tive bits where (x n _- T ,x r ,_. y+ i,...,x n _i) = (y 0 ,yi, • • • For example, in Z? 4 , 

do(0001,1001) = 4 — 1 = 3 since the longest string shared by the last consecutive 
bits in 0001 and the first consecutive bits in 1001 is the single bit, 1. It follows 
immediately that the directed distance between any two vertices in B n is at most n. 

The distance d(x,y) between x and y in B n is n — C where £ = max( 7 ,«,c) 
and 


• k is the longest string of consecutive bits satisfying the equality 

(l/n —*1 l/n-it+l i • • • ? 1/n—1) = (lO) ? • ■ • » %k— 1 )• 

• t = 2 (n — u) where u is the longest string of consecutive bits shared by both 
x and y. 

• 7 is defined above. 
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In Biy for example, d(0001,1001) = 4 — max(l,0,2) = 2. This observation 
allows for a simple way to find the edge-distance d e (v, C) in Gb„ between a vertex v 


and a cycle C generated from the PCRn+i- 

Lemma IV.26 Let F be the PCRn+i edge-factor of B n . Let 8 be the distance 
from a vertex v € V to a cycle C in F. Then there exists a directed path 
between v and x € (C) v of length 8 in B n . 

Proof: Follows directly from the discussion above. ■ 

Lemma IV.27 Let F be the edge-factor of B n generated by PCRn+i, 
v = 00... O il... 1 and a = \C\ for some C € F. Then d(v,C) > n — <x. 

Proof: If C is a short cycle in PCR„+i, i.e., \C\ = a < (n + 1) and <r\(n + 1), 

then at most a bits of any vertex in (C) v coincide with any cyclic shift of the 

n-tuple, v = 00... 0 11... 1. Therefore, the shortest path from v to C is of 
L?J r?i 

length at least n — a. ■ 

From Theorem IV.25 we see that the PCRn+i edge-factor possesses the prop¬ 
erties that should minimize the sum of the squares of the cycle lengths. From 
Lemma IV.27, we find that in the PCR„+i edge-factor the lower bound on the dis¬ 
tance between larger length cycles and the initial vertex v = .00 ... 0,11... 1. is always 

ifj r?i 

smaller than the lower bound on the distance between smaller cycles and v. 
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The following two theorems insure that the PCR« + i cycles in B n are irre¬ 
ducible. Proofs of the theorems can be found in Mykkeltveit [Ref. 20]. 

Theorem IV.28 The maximum number of adjacencies between tu arbitrary 
cycles in the PCRn+i is 2. 

Theorem IV.29 There cannot be more than one adjacency between a cycle 
with submaximal length (< n + 1) and any other cycle in the PCR n+ j. Two 
cycles, both of which have submaximal length, can not have any adjacencies. 

The ideas and concepts developed in this section and an exhaustive search of 
all double Eulerian walks in B n , 1 < n < 5 support the following conjecture: 

Conjecture IV.30 Let the PC Rn+i cycles be the edge-factor F of the digraph 
B n and v = 00... 0 11... 1 the initial vertex in a double F .ricn walk produced 

l?j r?i 

by Algorithm D. Then W v generates the minimum measure over all double 
Eulerian walks on B n . 

The PCR„+i factor can be constructed in an efficient fashion using the 6 Al¬ 
gorithm of Fredricksen and Kessler [Ref. 26]. Table VI provides the measure of the 
double Eulerian cycle W 2v generated by Algorithm D using the edge-factor of the 

PCRn+i and v = 00... 0.11 ... 1 for 1 < n < 11. The numbers for 1 < n < 4 were 
Lj j r?i 

found independently by Harper [Ref. 27], 
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n 

value 

1 

8 

2 

24 

3 

72 

4 

200 

5 

524 

6 

1,400 

7 

3,420 

8 

8,352 

9 

19,476 

10 

45,232 

11 

101,722 


Table VI. Conjectured values for B n , 1 < n < 11 



Figure 17. The Good - de Bruijn digraph Bj with a cycle generated by Algorithm D 
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It turns out that the double Eulerian cycles generated by Algorithm D using 

the edge-factor F of the PCRn+i and v = 00 ... OJ 1... 1, are not the only walks of 

Lf j r?i 

minimum measure in B n • In Figure 17 , for Example, a double Eulerian walk 

W l = (01,02,02,03,04,05,01,(13,04,06,07,05,07, 05,03,09, 

aio, on, Os, 09, aio, 012, ai 3 , an, 015, aj2,013, au, ai6, ai6, ai 5 , a n ) 

is generated by Algorithm D using the edge-factor of PCR4 and v = 001 . By an 
exhaustive search, we know this walk has the minimum measure of 72 for any double 
Eulerian cycle on B3. Wi was generated using cycles of lengths 4 , 4 , 4 , 2 , 1 , and 1. The 
double Eulerian cycle 

W'2 = (01,02,02,03,04,05, ai,a 3 , a4,06,07, ae, 07,05,03,09, 

Ol4, O15, On, ag, 09, Oi4, O16, Ol6, Ol5, 0i2, Oi 3 , Oio, Oi 2 , Oi3, Oio, Oil) 

also has a measure of 72 and uses cycles of lengths 5 , 4 , 3 , 2 , 1 , and 1. Thus, we see 
that walks of minimum measure are not restricted to the PCRn+i edge-factor. We 
have not, however, found an edge-factor in B„ where the length of the largest cycle 
is greater than n + 2 that also generates a walk of minimum measure in B n . 
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V 


A RANDOMNESS PROPERTY OF DE 
BRUIJN CYCLES 

“Living backwards, ” Altec repeated in great astonishment. “I never heard of 
such a thing.” 

U ...but there’s one great advantage in it, that one’s memory works both ways.” 

“I’m sure mine only works one way, ” Alice remarked. “I can’t remember things 
before they happen. ” 

“It’s a poor sort of memory that only works backwards, ” the Queen remarked. 

Through the looking glass, Chapter 5 


A. INTRODUCTION 

A strong statistical dependence exits between the predecessor and successor of 
each state of a binary feedback shift register (FSR). The consequences of randomly 
selecting each successor state from a choice of two possible states proves have interest¬ 
ing consequences. Selecting the successor state actually can modeled a rather simple 
Markov process, since the two possibilities cam be assigned probabilities. We show 
that the distribution of runs in a binary de Bruijn cycle coincides with the expected 
distribution of runs for a binary sequence of length 2 n generated by randomly selecting 
the successor state in a binary FSR. 

B. THE BALANCE PROPERTY 

A binary sequence 5 is saud to have the balance property if the number of l’s 


in 5 equals the number of 0’s in 5 [Ref. 6]. 












r 


Theorem V.l A de Bruijn cycle has the balance property. 

Proof: A de Bruijn cycle of length 2" has the property that every n consecu¬ 
tive bits are different on a given period. Expressed in decimal notation, each 
n-tuple can be thought of as representing an integer from 0 to 2" - 1. In this 
range there are 2 n-1 odd integers and 2 n_1 even integers. Thus, a de Bruijn 
cycle contains 2 n_1 l’s and 2 n-1 0’s and will therefore always possess the bal¬ 
ance property. ■ 

By a simple counting argument we find that the size of the set of all balanced 
binary sequences of length 2 n is where = C(n,r) is the binomial coefficient 

of n things taken r at a time. A de Bruijn cycle can be normalized by requiring that 
the sequence begin with exactly n consecutive 0’s. There are then (normalized 
sequences of length 2 n that possess the balance property. This follows since the n 
consecutive 0’s must be preceded and followed by a 1. 

For example, when searching the 2 5 -long binary sequences for the 2,048 de Bruijn 
cycles, there are = 60,108,390 sequences of length 2 s that possess the balance 
property. There are, however, only (jils) = 4,457,400 normalized balanced se¬ 
quences of length 2 5 . 


82 








C. THE RUN PROPERTY 

A binary sequence 5 is said to have the rtm property if among the runs of 
ZEROs (and ONEs) in S, one-half of the runs are of length one, one-fourth are of 
length two, one-eighth are of length three, and so on, as long as these fractions provide 
an integer value for the number of runs. 

Theorem V.2 A de Bruijn cycle has the run property. 

Proof: The run structure of a de Bruijn cycle S„ can be completely deter¬ 
mined. A run of ZEROs of length n occurs exactly once in 5„. This run 
of ZEROs of length n must be preceded and followed by a 1, or the n-tuple 
(00- -00) would appear at least twice in S n . The n-tuple (100...0) of a 1 
followed by (n — 1) 0’s occurs exactly once in the sequence. This, however, is 
already accounted for by the run of ZEROs of length n. Thus, there is no run 
of ZEROs of length n — 1. To find the number of runs of ZEROs of length k, 
for 1 < k < n — 2, we consider all n consecutive terms of the sequence that 
begin with a 1, then the fc-tuple (00.. .0), and then a 1. Each such run can be 
made to correspond to an n-tuple t of the form 

t = 1 00... 0. 1 xx ... i , 

k n—k—2 

where the z’s are chosen as arbitrary bits. Since we are free to choose each of 
the remaining n — k — 2 components, there are 2 n-fc “ 2 runs of ZEROs of length 
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n Run Distribution: D n 

1 "01 

2 { 2 } 

3 {1,3} 

4 { 1 , 1 , 2 , 4 } 

5 {1,1,1,1,2,2,3,5} 

6 { 1 , 1 , 1 , 1 , 1 , 1 , 1 , 1 , 2 , 2 , 2 , 2 , 3 , 3 , 4 , 6 } 

Table VII. Distribution of runs in S n 

k for 1 < k < n — 2. With the single run of ZEROs of length n, the result 
follows. The same distribution holds for runs of ONEs. ■ 

The multiset of the lengths of the runs of ZEROs (ONEs) in a 2"-long de Bruijn 
sequence is denoted by D n . Table VII is a list of the ZERO and ONE run distribu¬ 
tions, D n , for a sequence 5„ € S n , where 1 < n < 6. Any sequence of length 2 n 
whose run distribution is a permutation of the multiset D n has the run property. We 
alternatively indicate such multisets by specifying the number of times each different 
type of element occurs. Thus, D 5 can also be denoted by {4 • 1,2 • 2,1 • 3,1 • 5} where 
4, 2, 1, and 1 are the repetition numbers of the entries 1,2,3, and 5, respectively. The 

repetition number of each element k € D n is denoted by rp„(fc), where 

’ 

2 n-fc -2 if 1 < k < n - 2 

ro n (k) = (V.l) 

1 if k = n. 

k 

A binary sequence can be interpreted as a sequence of integers representing 
the lengths of the alternating runs of ZERO’S and runs of ONE’s in the sequence. We 
have seen that a binary 2 n -long sequence 5 corresponds to the run sequence R [See 
Chapter III, Section C]. The run sequence R consists of the subsequence Z n of the 
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lengths of the runs of ZERO’S interleaved with the subsequence O n of the lengths of 
the runs of ONE’s. The sequences Z n and O n for any de Bruijn cycle S n of length 2 n 
will always be permutations of D n because of the run properties of S„. 

In an arbitrary binary sequence of length 2" possessing the run property, there 
are a , ja J* 2 „_ 3 ) = ways that the runs of ZEROs may be arranged. 

We can, however, normalize the binary sequence (i.e., let it begin with the string of n 
consecutive ZEROs), and then there are only Q, ways to position the 2 n-2 

runs of ZEROs. Hence, there are 

( \( 2 n-2 — 1 \ 

\ 2 °, 2 1 , 2 2 ,... , 2 n_3 / \2*,2 2 ,... , 2 n ~ 3 y 

normalized binary sequences of length 2 n possessing the run property where the first 
factor counts the runs of ONEs and the second factor counts the runs of ZEROs. 

As an example, in a normalized binary sequence of length 2 5 possessing the 
run property, there axe ^ 4 ) = 105 ways that the runs of ZEROs may be arranged. 
The runs of ONEs can be independently arranged in ® 4 ) = 840 ways. Hence, the 
2,048 de Bruijn cycles of length 2 5 are a subset of the 105 x 840 = 88 ,200 normalized 
binary sequences possessing the run property. 

Clearly, not every permutation of Z n and O n generates a de Bruijn cycle. 
Interestingly, there is no de Bruijn cycle of length 2 s with either Z 5 ot O 5 equal to 
(51312121), (51121132), (51123211), (51212131), (51213121), (52113112), or 
(52311211). This begs the question of why specific permutations of D n cannot occur 
as Z n or O n in a de Bruijn cycle that is answered in the following section. (Note: 
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When S n is viewed as a cycle, care must be taken not to break the run sequences 
across a run when the sequence is cycled.) 

D. A RANDOMNESS PROPERTY 

A binary sequence of length 2 n having the balance and run property is not 
necessarily a de Bruijn cycle. For example, the balanced sequence 0000111100110101 
has the run property, but clearly this is not a de Bruijn cycle. We show that if the 
run sequences Z n and Q n of a balance binary sequence with the run property are 
arranged randomly , the resulting binary sequence is de Bruijn. We define what we 
mean by randomly in the sequel. 

The simplest possible non-trivial experiment is one that may result in either of 
two possibilities. Such an experiment is called a Bernoulli trial and the two outcomes 
are labeled as 1 or 0 (success or failure). This framework is used in what follows. 

Let A and B be copies of the multiset D n . Suppose the elements from the 
multiset A are viewed as being placed randomly between the positions of another 
circular permutation of the second multiset B. The interleaving of A and B generates 
a run sequence R. (We view A as the multiset of the lengths of the runs of ZEROs 
and B as the multiset of the lengths of runs of ONEs.) Given an arbitrary element j 
in Z?, let X\ be a binary indicator variable defined on j satisfying X\ = 1 if and only 
if an element 1 from A is to the immediate right (TTIR) of j , otherwise X\ — 0. The 
probability function for the discrete random variable X x is 

p Xl (x) * P(X i = x), for x € {0,1}. 
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Since the element 1 € A accounts for 2 n-2 of the 2 n_3 elements in D„, it follows 
immediately that p x ,(l) = .5 = p x ,(0). The expected value for X\ is 

E[X i]= £ xp Xl (x) = 0(.5) + l(.5) = (.5). 

*€{ 0 , 1 } 

The expected value of X\ is the probability that a 1 € A is TTIR of an element in 
B. We know that the expected value of a sum is the sum of the expected values. The 
expected number of l’s of A that are TTIR of all of the entries j in B is denoted 
E[l;j] — \rD n {j), where ro n (j) is the repetition number of the element j in the 
multiset D n . From Equation V.l, it follows that 

2 n -j-3 if l < j < n - 2 

£[i; j] = - (v.2) 

2- 1 ifj = n. 

£[1; j] is an integer except when j is either (n-2) or n. We find it necessary to specify 
the expected number of l’s that are TTIR of (n - 2) 21 TTIR of (n) in B, denoted 
£[1; n - 2, n], so that an integer value is realized. We then find £[1; n - 2, n] = 1. 

Similarly, we define Xj on j € B to be a binary indicator variable where 
X-i = 1 if and only if a 2 in the multiset A is TTIR of j, otherwise X 2 = 0. It follows 
immediately that p Xj (l) = .25, p x ,(0) = .75 and E[X 2 ] = 0.25. 

The expected number of 2’s that are TTIR of all the j’s in B, denoted £[2;j] 
is equal to (.25)ro n (j). From Equation V.l, it follows that 

{ 2 "-i-« if 1 < j < n - 2 

(V.3) 

2~ 2 if j = n. 
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In genera], we define the random variable Xk on j € B to be a binary indicator 
variable where Xk = 1 if and only if a k is TTIR of j, otherwise Xk = 0. The 
probability function for Xk is given as 

2 ~ k if 1 < k < n - 2 

Px k ( 1) = (V.4) 

2~ k+2 if fc = n, 

and P Xk ( 0 ) = 1 - Px„(i)- The expected value of Xk is given by 

{ 2 ~ k if 1 < k < n - 2 

(V.5) 

2 -‘ +! if fc = n. 

The following Lemma provides the expected number of elements TTIR of each 
type of element when random permutations of D n are interleaved. 


Lemma V.3 If elements from A — D n and B = D n are placed randomly in 
alternate order, TTIR of all the j ’$ in B we expect to find exactly E[k,j) k’s 
in A, with 


E[k,j\ = 


2 *»— j—k —2 

if 1 < j, k < n — 2 

2 ~ k 

If l<fc<n — 2 and j = n 

2~ 3 

if k — n and 1 < j < n — 2 

2 ~ k+2 

ifkj = n 


(V. 6 ) 


and where the multiset D n represents the run distribution for a 2 n -long binary 
sequence possessing the run property. 

Proof: Follows directly from the Equations V.l and V.5 ■ 
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We define the multiset 


/?! = {£[l;l|-l,£[2;l]-2,...,£[n-3;l)-n-3}, 

where Ri is the submultiset of integers of A that represent the elements that are 
expected to be TTIR of all the l’s in B. Recall that E[k; j] = £[Xfc]ro n (j). Therefore, 

Ri = {£[X,]r 0n ( 1) • 1, E[X 2 ]r Dn (\) • 2,..., E[X n . 3 )r Dn (l) • n - 3}. 

In like manner, i? 2 — {£[1; 2] • 1, £[2; 2] • 2,..., £[n — 4; 2) • n — 4}. 

We define the multiset Rk = {£[1; fc]*l,£[2;fc]*2, . ..,£[n — k — 2;fc]-fc}, where 
1 < k < n — 2. The multiset Rk is the submultiset of integers of A that represents 
the elements that axe expected to be TTIR of all the Ar’s in B. 

In general, let a be a submultiset of B and define R a to be the multiset of 
integers that represents the elements in A that are expected to be TTIR of all the 
elements in a. If a and are multisets in B, Ra = Rp if and only if |a| and \(3\. 

In a similar manner, we denote the multiset of integers that represent the 
expected elements from A that are TTIR of all the j{ s, j 2 ’s, ..., j q ' s, in B by 
fyjithi—Ji)' We define R^ = R(k+i,k+ 2 ,...,n- 2 ,n)- 

Lemma V.4 R% = R k for every 1 < k < n — 2 

Proof: We need to show that \k\ = \k\. By Equation V.l, ro n (k) = 2 n ~ k ~ 2 
for 1 < k < n — 2 and r £ j n (fc + l) + --- + r £ j n (n- 2 ) + rD„(n) = (£!fe=i - 3 2 *) + l. 

n-fc-3 

Let 2 fc = 5. Then 2 S — S = S = 2 n ~ k ~ 2 — 1 . It follows immediately that 
fc =i 

r D n (k) = r D„(k + 1) + • • • + r Dn (n - 2) 4- r Dn (n). Therefore, £* = R k . ■ 
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As an example, let E% be a binary sequence whose run sequence consists of 
alternating terms from two respective multisets D 6 that have been randomly inter¬ 
leaved. Let 2q be the sequence of run lengths of ZEROs and 0$ the sequence of run 
lengths of ONEs. We would expect the following: 

1 . Exactly four l’s, two 2’s, and one 3 in are TTIR of the l’s in O s - 

R, = {1,1,1,1,2,2,3}. 

2. Exactly two l’s and one 2 in Zq are TTIR of the 2’s in Oe. Ri — {1,1,2}. 

3. Exactly one 1 in Z& is TTIR of the 3’s in Os- R 3 = {1}. 

4. Exactly one 1 in A is TTIR of either the 4 or 6 in Oe- R(4.e) — {!}• 

Similarly, we may continue to determine the expected values of the elements that 
must be TTIR of the elements in the multisets R\ and R 2 . For example, we define 
the multiset 

R 1/1 = {EIXJEIU 1 ] • 1 , E[X x )E[2\ 1 ] • 2 ,..., £[X,]E[n - 4; 1 ] • n - 4}, 

where R1/1 is the multiset of integers that represents the elements expected TTIR of 
all the l’s in Ri. 

In general we define 

R., = {(£[*i))‘' 1 £[!;*,) • MfilA-,])*-' -2. 

where p = n — xi — 22 * • * — x* — 2. The range of values for p insures integer values 
for the representative numbers in R Xi /.../ Xk - It follows from Equation V .6 that 

Rx m /.../ Xl = { 1 } when xi + • • • + x m = n - 3. (V.7) 
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From Equation V.6, one sees immediately that E[k,j] = E\j,k]. Therefore, since 
multiplication is commutative, 


Exi/.../xk Ryi /•••/»* 

where yi ,... y* is any permutation of xi,... x*. Hence, 

Rxi/.../x k = ^(*I+-+X*)- (^*8) 

Continuing the above example, we also expect to find the following in E$: 

1. Exactly two l’s and one 2 in Oe are TTIR of the l’s in R\. R x /i = {1,1,2}. 

2. Exactly one 1 in Oe is TTIR of the 2’s in /?i- Ri/ 2 = {1}. 

3. Exactly one 1 in Oe is TTIR of the l’s in R 2 . R 2 /i = {1}. 

This process can continues until R Xl /.../x„ — {1}. This will occur when 

xi 4--h x m = n - 3. 

Concluding the example, we also expect exactly one 1 in Oe TTIR of the 
two l’s in R\/ 1 , therefore, R\/i/i = {1}. Furthermore, since R\ = {1,1,1,1,2,2,3}, 
it follows that Rj = R( 2 , 3 , 4 , 6 ) is also equal to {1,1,1,1,2,2,3}. In a like manner, 
R$ = R( 3 , 4 , 6 ) = {1»1»2}, etc. 

A binary sequence whose run structure consists of alternating terms from D n 
with the expected distribution of elements to the immediate right and left of each like 
of element in D n , is said to have the Expected Value Property. 
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A binary sequence of length 2 n with the Expected Value Property exhibits the 
following: 

Hi = {1,..., 1,2,... ,2,3,...,3,... (n — 4),(n — 3)}, 


2«—4 2 n “* 5 2 n ~® 


2 l 


% = Hi, 


H 2 = {l,...,l,2,...,2,...(n —5),(» — 4)}, 


Jn-S 2 n “® 2 l 


H 2 = H 2 , 


H* = 1 , 2,..., 2 ,... (n - (fe + 3)),(n - (A: + 2)}, 

2 n-(*+3) 2 n-(*+4) 2 

= H fc 


Hn— 3 = {1}, and 
H n -3 = R(n—2,n) = Rn—3- 

Equations V.7 and V.8 have far-reaching implications. In particular, they provide a 
connection between the Expected Value Property and de Bruijn Sequences. 

Theorem V.5 A binary sequence S of length 2” is a de Bruijn cycle if and 
only if the run sequences of S possess the Expected Value Property. 

Proof: Let 5 be a binary sequence with the Expected Value Property. Since 
5 consists of alternating terms from two copies of D n it follows immediately 
that S has length 2 n . 
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Let S J be the sequence formed by cyclically shifting 5 by j bits to the left. The 
sequence 5 has 2 n distinct n-tuples if and only if for every j , 1 < j < 2 n , the 
sequence formed by the sum S J © S does not contain a string of n consecutive 
0 ’s. 

Let 5 possess the interleaved run sequences O n and 2„, each of which is a 
separate permutation of D n , for 1 < j < 2”. Let T be the longest consecutive 
string of 0’s from the sum sequence S j © S. The length of T is |!T| = t. Let 
R = (r t ,r 2 ,... ,r t _i,r t ) and W = (u>i,u> 2 ,... be the consecutive bits 

from S> and 5, respectively, such that R ®W = T. 

Since T is the longest consecutive string of zeros, the bit r 0 that immediately 
precedes R and the bit w 0 that immediately precedes W must differ. Without 
loss of generality let r 0 = n = 1. Define k by w\ = w 2 = • • • = = Wk but 

w k ^ Wk+ 1 - Then k is the length of a run of ONEs in W. 

We now show by contradiction that the sequences 5 has no repeated n-tuples. 
Since the run lengths in 5 are elements in D„, k < n — 2, we know that 

R(k,k+i .n— 2 ,n) = Rk- 1 - With A: — 1 < n — 3, the properties on page 92 and 

equation V.8 can be used. Let ii, i 2 ,..., x m be the lengths of the runs that 
coincide after the bits r k and n;*. Let x p , where p < m be the length of the 

last runs in R and W that concide, where (k — 1) + x\ -|- f- x n < (n — 3) and 

(k — l) + xi +-HXp+Xp+i >(n — 3). Let (Ar —l)-f-Xi-)-1-x p = n — d, where 
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d > 3. Since Rk-i/xi/.../x p = Rn-d , it follows that x p+ i > d — 3. The length of 
the longest run that coincides between R and W TTIR of Rn-d, however, is 

d— 3. We find, therefore, that (fc—1)4-ii +-hx m = n—g < n — 3. The length 

of the longest string of bits that coincides TTIR of Rn_ a , however, is g — 2. 

Since (k— l)4-xH-1-x m = n-g , it follows that k+x H- \-x m +g -2 = n-1 

is the length of the longest string of bits that coincide. Therefore, 5 has no 
repeated n-tuples and is therefore a de Bruijn cycle. 

Conversely, let S be a de Bruijn cycle. Since S is de Bruijn, it consists of 
alternating terms from D n and S has the Run Property. Consider the runs 
of ONEs of length r where 0 < r < n — 2. Each such run can be made to 
correspond to an n-tuple along the sequence S of the form 

0.11 • • • 10 xx • • • x , 

r n-r-2 

where the x’s are arbitrary bits. The number of runs of ZEROs of length j 
where n — r—j > 0 that are TTIR of the runs of ONEs of length r is 2 n-r-J-2 . 
This coii. .ides with the expected integer values given by Lemma V.6. Hence, 
the Expected Value Property for S follows immediately from the Run Property 
and the uniqueness of the n-tuples in a de Bruijn cycle. Therefore, a binary 
sequence of length 2 n with the expected value property is a de Bruijn cycle. ■ 
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Interleaving any permutation of the run sequences of D n for 1 < n < 3 gen¬ 
erates a de Bruijn cycle. For n = 4 we find that a sequence with the expected value 
property must satisfy the following condition with respect to its run sequences: if the 
sequence of runs of ONEs/ZEROs has 2 consecutive l’s in it then the sequence of runs 
of ZEROs/ONEs cannot have 2 consecutive l’s. This condition provides a very simple 
way to generate the 16 de Bruijn cycles of length 2 n . There are 3 different normalized 
permutations of D 4 : (4,2,1,1), (4,1,1,2), and (4,1,2,1). There are 8 permutations of 
D 4 with consecutive l’s : (4, 2 , 1 , 1 ), (4, 1 , 1 , 2 ), ( 2 ,4, 1 , 1 ), ( 2 , 1 , 1 ,4), (1,4, 2 , 1 ), ( 1 , 2 ,4,1), 
(1,1,4,2), and (1,1,2,4). Finally, there are 4 permutations of D 4 with no consecutive 
l’s: (4,1,2,1), (2,1,4,1), (1,4,1,2), and (1,2,1,4). The 16 de Bruijn cycles are generated 
by: 


1 . Interleaving the two normalized runs sequences from D 4 having consecutive 
l’s with the four runs sequences from D 4 not having consecutive l’s, and 

2. Interleaving the single normalized run sequence from D 4 having no consecutive 
l’s with the eight runs sequences from D 4 having consecutive l’s. 


Table VIII displays these sequences. 

In the general case, although the expected value property can be used to 
generate de Bruijn cycles, it does not provide a more efficient means to construct 
them than appears elsewhere. 
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4 2 11 

4,1,2,1 

1111000011010010 


2,1,4,1 

1111001101000010 


1,4,1,2 

1111011000010100 


1,2,1,4 

1111011001010000 

4,U,2 

4,1,2,1 

1111000010100110 


2,1,4,1 

1111001010000110 


1,4,1,2 

1111010000101100 


1,2,1,4 

1111010010110000 

4,1,2,1 

4,2,1,1 

1111000010011010 


4,1,1,2 

1111000010110100 


2 ,4,1,1 

1111001000011010 


2,1,1,4 

1111001010110000 


1,4,2,1 

1111010000110010 


1 ,2,4,1 

1111010011000010 


U,4,2 

1111010110000100 


1 ,1,2,4 

1111010110010000 


Table VIII. The de Bruijn cycles of length 2* generated by run sequences 
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VI. A MEMORYLESS ALGORITHM TO 
GENERATE DE BRUIJN SEQUENCES 


“To change and change for the better are two different things. ” 

German Proverb 


A. INTRODUCTION 

In this chapter we present an algorithm to construct a subset of all the binary 
de Bruijn sequences of length 2 n+1 . A comprehensive survey of previous work on this 
subject can be found in [Ref. 28]. A common approach to this process is to join 
cycles together to form a full cycle. This same general approach will be followed here. 
The algorithm presented is not limited to joining pure cycles, however, using the PCR 
feedback function and its pure cycles produces the most efficient algorithm. Unlike the 
Universal Algorithm [Ref. 29], the representative vertex for each cycle is determined 
by a distance function on the digraph B n , rather than by finding the smallest element 
among all the vertices on a cycle. Our algorithm produces an Eulerian cycle in B„. 
As this is isomorphic to a Hamiltonian cycle in B n + i, our algorithm allows for an 
interplay between vertices and arcs that provides a systematic way to join cycles. 
As a consequence, we decompose the Good - de Bruijn digraph into an edge-factor 
to insure each arc is traversed, rather than into a factor that partitions the set of 
vertices. 
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B. BACKGROUND FOR SIMILAR APPROACHES 

Golomb [Ref. 6] describes the key-sequence method to generate de Bruijn se¬ 
quences. One generates the set of span n de Bruijn sequences, S n , from preference 
functions (P*, P 2 *) that are obtained from feedback formulas and from the preference 
functions (Pi, P 2 ) for the span n — 1 sequences. The key-sequence method requires 
considerable memory storage to generate the sequences because every de Bruijn se¬ 
quence of smaller span is utilized in order to generate the de Bruijn sequences of 
span n. 

Fredricksen [Ref. 30] shows how to generate 2 2n-5 full cycles of length 2 n from 
the prefer ones sequence using the PCRn. The method uses 6n bits of storage, and 
n units of time to produce the next bit from a given n bits. 

Etzion and Lempel [Ref. 31] show how to generate full cycles of length 

2 n from the PCRn using 3n -I- (k)g(n, k) bits of storage, where k is a constant in the 
range 1 < fc < 2V )Sf ( n ,l:) ss (n-21og k)(l- l+Iog k ^' t* me required to produce 
the next bit from the last n bits is O(n). 

C. JOINING OF CYCLES 

Common to all of the above techniques is the notion of cycle joining. When 
the two possible predecessors of a state both map into the same state by the feedback 
function / we say / is singular. Any feedback function that is not singular is therefore 
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nonsingular. The following are equivalent conditions: 


1 . / is nonsingular. 

2 . / produces no branches. 

3. / is a one to one mapping. 

4. / is an onto mapping. 

5. Every state has a unique predecessor and a unique successor. 

6 . / produces only cycles. 

The following theorem [Ref. 6] is stated without proof: 

Theorem VI. 1 (Golomb) A necessary and sufficient condition for distinct 
states to have distinct successors is that the corresponding (nonsingular) feed¬ 
back function f: J3 n+1 —► {0,1} can be written 

f{Zo, Z\ » • • •» Zn) = Zo ® fl(Zl, Z], ..., z n ), (VI. 1) 

for some function f\: B n -* {0,1} 

Any edge-factor of the digraph B n represents a feedback function that satisfies 
equation (VI.l). Furthermore, any feedback function satisfying (VI.l) will have dis¬ 
tinct successors and the function therefore yields an edge-factor in the corresponding 
Good - de Bruijn digraph. 

Since B n is a 2-regular strongly connected digraph, when B n is decomposed 
into an edge-factor F containing two or more cycles, each distinct cycle is adjacent 
(i.e., arc disjoint and sharing a common vertex) to some other cycle. By the following 
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lemma, it is always possible to join together two adjacent cycles in a edge-factor to 
form a single cycle. 


Lemma VI.2 Let F be an edge-factor of the digraph D. Let (R) and (T) be 
distinct adjacent cycles in F with x € (R)v and x € (T) v . The cycles (R) and 
(T) are joined into a single cycle when the respective successors of the vertex 
x in each cycle are interchanged. 

Proof: Let the following sequence of vertices, (R) = (r*i,r 2 ,... ,r/) and 
(T) = (ti, f 2 ,..., t m ), represent adjacent cycles. Since the cycles are adjacent 
there is a vertex x that lies in each of ( R) v and (T) v . Let r, = tj = x. The 
sequence of vertices (n, r 2 ,..., r„ t J+1 , £j +2 ,..., tj , r< +1 ,..., n) created by ex¬ 
changing the successors of i.e., exchanging outgoing arcs of x, forms a single 
cycle. ® 

To illustrate Lemma VI.2 consider the PCR3 where f(zo, ^i, 22 ) = zo on B 2 
shown in Figure 18. This function generates the edge-factor of Figure 19A. The cycles, 
represented by a sequence of arcs, are: 

1 . Ci = ( 0 ) 

2. C 2 = (001) 

3. C 3 = (Oil) 

4. C< = (1) 
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Figure 18. The Good - de Bruijn digraph Bi as one edge-factor 



Figure 19. Edge-factors 
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The cycles C 2 and C 3 contain the common vertices 01 and 10. By Lemma VI.2 
a single cycle is formed when the successors, 10 and 11, of the vertex 01 are inter¬ 
changed between the cycles C 2 and C3, respectively. The cycles now, represented by 
a sequence of arcs, axe: 

1. CJ = (0) 

2. C 2 = (101001) 

3. C5«( 1) 

This new edge-factor is shown in Figure 19B. Therefore, the problem of joining 
cycles from an edge-factor reduces to finding the vertices common between adjacent 
cycles. 

To construct the algorithm that generates a de Bruijn sequence, a representa¬ 
tive vertex for each cycle plays an important role and must be identified. The cycle 
representative for each cycle is any vertex in the cycle that has the smallest directed 
distance from a designated vertex vp in B n . (The designated vertex vq is an arbitrary 
but fixed vertex in B n .) The cycle representative (C,)/i of the cycle (C,) is defined to 
be any vertex t>< € {Ci) v such that for the given designated vertex vq € V, 

do(vD,vi) = min ( 1 d(vd,v). 
v€(Ci). 

If there is more than one vertex on a cycle of minimum directed distance from up, we 
can define a unique representative by selecting, say, the vertex of largest/smallest dec¬ 
imal value. (Recall that the directed distance do{x, y) between x = (xo,xi,...,x„_i) 
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and y = (yo»yi, • • • »3/n-i) in B n is n — 7 , where 7 is the longest string of consecutive 
bits where in —*+ 1 ,..., in—i ) = (yoi 2/ii * ■ • * y-i —1 )•) 

Lemma VI.3 shows that cycle representatives are unique to a cycle unless the 
representative is the designated vertex vp. 

Lemma VI.3 For an edge-factor F = {Ci,Ca,...,C m } of B n = (V,/4), let 
Ci € F where dp(vp,(Ci) v ) > 0 and vp is the designated vertex. Then there 
exists a cycle Cj € F adjacent to Ci such that dp(vp,(C } ) v ) < dp(vp, {Ci) v ). 

Proof: The proof is analogous to the proof of Lemma IV. 17. ■ 

Corollary VI.4 Every vertex that is the representative of its cycle but not 
equal to the designated vertex vp is incident with exactly two cycles in any 
edge-factor. 

Proof: This follows immediately from Lemma VI. 3. ■ 

Lemma VI.3 can be applied with distance functions other than the directed 
distance from a designated vertex. The representative vertex (Cj)/} of the cycle Ci can 
be defined as either the vertex for which dp{{Ci)R,vp,) = min v6 (c.)„ d£»((C;)fl, i>£>) or 
the vertex for which d(vp, (C,)r) = min^c,). d(vp, (C,)/?). In either of these cases 
the properties of the representative vertex remain the same and the concepts in the 
next section can be applied. 
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D. THE CYCLE JOINING ALGORITHM 

Consider a feedback shift register with n + 1 stages. The tth state of the 
FSRn+i is denoted by 5, = (s,, Sj+i,..., $<+„). Given a nonsingular feedback function 
/ on the register, the next state following 5, = (s,, s l+ j,... , s, +n ) is obtained from 
the Cycle Joining Algorithm as follows: 

Cycle Joining Algorithm 

Input: A feedback function / for a FSRn+i , the current state 5, = (s,, s,+j,..., s l+ „) 
of the register, and a list of representative vertices. 

Output: The next state Si+ 

(Cl) if (s 1+1 ,... ,3j+ n ) designates a representative vertex, 
then 5 i+ x := /($<) ® 1 
else S <+ 1 = f(Si). 

(This cycle joining algorithm is analogous to others, however, it was originally used 
by Fredricksen [Ref. 32].) 

We can apply Lemmas VI.2 and VI.3 to provide a simple way to construct a 
de Bruijn cycle of length 2 n+1 . At each shift of the FSRn+i we check the last n bits in 
the register. The n bits represent a vertex in B n . If the vertex is a cycle representative 
and this is the first occurrence, we expand the current cycle, by joining an adjacent 
cycle in the edge-factor to the current cycle. If the vertex is not a cycle representative 
we continue along the current cycle. If the vertex is a cycle representative and this 
is the second occurrence, we close the current cycle and enter a previous cycle. The 
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algorithm constructs an Eulerian walk W VD along B n by successively selecting cycles 
from an edge-factor to be added and traversed. This is accomplished by stepping along 
a nonsingular shift register until the current state identifies a cycle representative at 
which time the Cycle Joining Algorithm is executed and a new cycle is added. 

We now describe an algorithm (Algorithm H) that constructs a Eulerian walk 
along B„ beginning at the designated vertex vq- Algorithm H adjoins cycles from an 
edge-factor F to create the Eulerian walk. This algorithm is similar to other common 
algorithms [Ref. 29] that join cycles from a factor and hence create a Hamiltonian 
cycle along B n . Here the edge-factor method allows us to use the properties of 
distance in the digraph to uniquely identify a common vertex on adjacent cycles 
in B„. Loosely, speaking, Algorithm H moves forward, entering and adding cycles 
at a cycle representative of increasingly greater distance from the vertex vd for as 
long as this is possible. Each cycle representative is on exactly two cycles. Algorithm 
H enters a cycle representative ( D)n for the first time along an arc on a cycle C 
where doivp, (C)„) < do{vo,{D) v ). The algorithm immediately departs the vertex 
( D)r along am arc on the cycle D. When the cycle representative {D)r is entered for 
the second time, it is along an arc on the cycle D and the vertex ( D)r is departed 
along am arc on the cycle C. When it is no longer possible to enter a new cycle at 
a greater distance, the algorithm traverses each arc on the last adjoined cycle and 
then backtracks to the previous cycle at its cycle representative. The backtracking 
continues to “close” previously entered cycles until the first cycle from which it is 
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possible to again go forward to enter and add new cycles. A cycle C could conceivably 
be entered at several places along the walk. Algorithm H, however, does not allow 
a cycle C to be entered for the first time at any vertex other than at the cycle 
representative ( C)r. 

ALGORITHM II [Constructs an Eulerian walk along B n ] 

Input: A Good - de Bruijn digraph B n = (V, A), a designated vertex vp € V where 
vd = (t>i, v 2 ,. -., u„) € 5", and a shift register with a nonsingular feedback function 
/ of degree n + 1. 

Output: A sequence (0,si,$ 2 ,...,s 2 n+i_i) constituting an Eulerian cycle through 
B n . 

Algorithm H 

Initially set the register to Si = (0, i/ 2 ,..., v n ) = (si, s 2 ,..., s„). Step the register 

once to produce S 2 = («i, s 2 ,..., s n , f(Si ® 1)), and set i = 3. 

HI Si-i = ... ,s,+n) lies on cycle C. We examine each of the states 

on the cycle until a state is found that identifies a vertex of greater distance 
from the designated vertex than the current state. If no such state exists 
(i.e. the current state identifies the cycle representative), then go to [Hla], 
else go to [Hlb]. 

Hla Si = f(Si-i)); go to [H2]. 

Hlb Sj — (^<— 1 , Si,. . •, , /(Si_i) © 1)", go to [H2]. 

H2 Increment i. 

H2a If i < 2 n+1 go to [Hi]; else end. 

We now show that the walk constructed by Algorithm H visits every edge in 
B n exactly once. 
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Corollary VI.5 Let F = {Ci,C 2 ,...,C m } be an edge-factor of B n = (V, A) 
and vq is the designated vertex. Furthermore, let vp be a vertex in exactly one 
cycle in F. We define a graph Tjg n as follows: the vertices in Tb„ are the cycle 
representatives of the cycles in F, and there is an edge between vertices x and 
y in Tb„ if and only if x,y € (<?,•)„ for some cycle Ci € F. Then T is a tree. 

Proof: The order of Ts n is m, the number of cycles in F. By Theorem VI.4 
for each cycle C, € F where dz>(no, (C'i)v) > 0 there exists a unique cycle 
Cj 6 F adjacent to Ci such that cId(vd, (Cj) v ) < d[>(v, (C,) V C). Therefore, 

T Bn is connected and the size of the edge set of Tb„ is m — 1. Hence, Tb„ is a 
tree. In like manner, in the case where vq is in exactly 2 cycles in F, Tb„ is 
also a tree. ■ 

Thus, given an edge-factor F of the digraph B n = (V, A) and a vertex vp € V , 
Algorithm H joins every cycle in F and thereby constructs an Eulerian walk along 
B n beginning at the vertex vq. (Algorithm H is in fact a depth first search of 7b„.) 
Since Tb b is connected, every arc is traversed exactly once. 

To illustrate the algorithm, consider a FSR$ and the pure cycling register 
defined by the feedback function f{z 0 , zi,Z 2 , 23 , 24 ) = z 0 . Let the designated vertex 
vd in B 4 be 0011. The edge-factor contains the eight cycles and their respective 
rep ’sentatives as shown in Table IX. (Note: the algorithm does not compute the 
table of representatives a priori , rather, representatives are computed on line.) The 
cycle (00101) has two vertices 1001 and 1010 each of the same minimum directed 
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Table IX. Cycles and representatives for a PCRs, where vq = 0011 


distance from vd• We arbitrarily select the vertex 1010 to be the representative, as 
it is of largest decimal value. Applying the Cycle Joining Algorithm to this FSRs, 
t.ie cycles are joined together to form one cycle, shown in Figure 20. The de Bruijn 
sequence representing this cycle is 

00011111011100110100101011000001 . 

Changing the cycle representative of (00101) to 1001, the cycles are joined to 
form a different sequence 

00011111011100101001101011000001 . 

Continuing the example with the same feedback function, when the designated vertex 
is changed to 1001 and using the smallest representative 0101 for the cycle (01011) a 
different sequence is obtained 

01001111101110010110101000110000. 
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Finally, with up = 1001 and selecting 0110 as the representative of the cycle (01011) 
we get the sequence 

01001111101110010100011010110000. 

Each of these sequences are different (i.e., none is a cyclic shift of another). It is not 
always the case that distinct vd's yield different sequences. Clearly, identical sets 
of representative vertices generate identical sequences. The size of the class of de 
Bruijn sequences generated by the pure cycling register using Algorithm H has not 
been determined. Using just the feedback function of PCR 4, however, 12 of the 16 
de Bruijn sequences of length 16 can be obtained by changing the vertex up. 

E. THE FEEDBACK FUNCTIONS 

Using the PC Rn, the Cycle Joining Algorithm requires storage of n bits for the 
current state Si, n bits for cycling the current state, and n bits to store the current 
representative on a cycle, for 3n bits of storage. The greyest amount of time in this 
algorithm is spent to determine the representative vertex on each cycle. The worst 
case occurs when the current state is the cycle representative and the entire cycle 
must be traversed. Therefore, having an edge-factor with relatively short cycles is 
desirable. For an arbitrary nonsingular feedback function, the length of the longest 
cycle can be expected to be very large [Ref. 6]. Jansen et.al. provides an approximate 
count on the number of linear functions that yield cycles whose maximum cycle length 
is 4n [Ref. 29]. 


109 


















VII 


CONCLUSIONS 


“ Results! Why, I have got a lot of results. / know several thousand 
things that won’t work. ” 


Thomas Edison 


A. SYNOPSIS OF THE THESIS 

The binary ie Bruijn cycle provides a special case of extracting distinct 
n-tuples from a 2 n -long binary sequence. Generalizing the de Bruijn property, i.e., 
extracting n bits that are not necessarily consecutive along the cycle, was the funda¬ 
mental idea motivating this work. There appears to be no previously published work 
analyzing the set of complete cycles, a set that includes the well-known de Bruijn 
cycles. 

In Chapter III, the framework is developed to systematically comb from a 
sequence every n-tuple appearing at successive positions along a subsequence of the 
sequence. We found that the run structure for all but one class of complete sequences 
is completely determined. The set of complete sequences with m < n consecutive 
teeth defines a cycle visiting every arc an equal number of times along an appropriate 
Good - de Bruijn digraph. Only the case for m = n — 1 is thoroughly discussed in 
Chapter III. Chapter III concludes with a probabilistic argument that shows that the 
size of the set of complete sequences for a specific (/,n)-comb is substantially smaller 
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than the size of the set of 2 n -long de Bruijn cycles. The class of de Bruijn sequences 
is the largest class of complete sequences. This surprising result is not intuitively 
obvious, given the randomness properties of the eligible sequences. 

Chapter IV provides a comprehensive survey on double Eulerian cycles along 
graphs and digraph. The necessary and sufficient conditions for a graph or digraph 
to have a double Eulerian cycle are given. We describe a class of double Eulerian 
cycles along a graph (digraph) that minimizes the difference of visitation times along 
each edge (arc). The necessary and sufficient conditions are found to generate a 
minimum double Eulerian cycle. A closed formula using the edge-status of a graph 
is shown to provide the value of a graph. An algorithm is presented that, given 
the appropriate edge-factor, generates the minimum double Eulerian cycle along a 
digraph. A closed formula is provided for the value of a Eulerian digraph. A conjecture 
for the appropriate edge-factor to generate the value of the Good - de Bruijn digraph 
is made. 

Chapter V provides the necessary and sufficient conditions for the permutation 
of the run sequences in a de Bruijn cycle. We see that the structure of distinct 
n-tuples is a randomness property of the runs in the sequence. This structure is a 
specific case of the property that each arc in a walk defined by a class of complete 
cycles is equally likely to be taken along an appropriate digraph B n . 

A new efficient algorithm for the generation of classical de Bruijn sequences is 
developed in Chapter VI. The algorithm uses only 3n bits of storage to generate a 
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de Bruijn cycle of length 2". Although this algorithm adjoins cycles in the usual way, 
the class of sequences generated is different from that found by previous algorithms. 


B. OPEN QUESTIONS 

The scope of potential research in the area of generalized de Bruijn cycles is 
vast. The very fundamental question of the existence of nonclassical de Bruijn cycles 
for each value of n remains to be answered. A general formula describing the size of 
the set of nonclassical sequences continues to be an elusive problem. 

The formulation of an algorithm to generate a class of complete cycles for each 
value of n would be a significant contribution in this area of research. 

There are many different (Z,n)-combs available to extract distinct n-tuples. 
Further analysis on specific combs would be a valuable extension of the work done in 
Chapter III. 

Proving the conjecture in Chapter IV on the value of the Good - de Bruijn 
digraphs would complete an important contribution to the analysis of double Eulerian 
cycles on directed graphs. More generally, finding the appropriate edge-factor of a 
digraph to generate the value of a graph would be a very nice result. 

The class of sequences generated by the algorithm in Chapter VI has not been 
fully examined. The relation, if any, between these sequences and sequences generated 
by previously algorithms would be an interesting result. Determining the designated 
vertex, representative vertices, and the distance function to generate the prefer ones 
sequences, for example, could provide insight into a new algorithm. 


113 





These are but a few of the promising research areas that arise from the chal¬ 
lenge of understanding complete sequences. A thorough understanding of nonclassical 
de Bruijn sequences is an endeavor to find structure in randomness, a statement that 
might be make of every research endeavor! 

This appears to be the first study of complete sequences, and it portends to 
be a rich area of continuing research. 
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APPENDIX A. SUMMARY OF SOME 
PREVIOUS WORK ON CIRCUITS IN 
DIRECTED GRAPHS 


“In all things, there is a law of cycles” 

Tacitus 

The following is a brief summary of the derivation of Theorem III.3. This 
theorem allows one to easily determine the number of ways to traverse each arc of 
an Eulerian digraph exactly u times. The development follows the work by van 
Aardenne-Ehrenfest and de Bruijn [Ref. 12]. 

Let S m be the permutation group (or symmetric group) on m elements. If S 
is a subset of S m then the number of elements of 5 of order m is denoted by ||S|| and 
the total number of elements in 5 by |5|. A subset V of 5 m is called a V-set (in S m ), 
whenever it has the property that ||sZ>|| has the same value for all s 6 S m . 

Let k and n be natural numbers, and take m = kn. Consider the set E m of 
m elements, divided into k systems, each containing n objects. The set H denotes 
the subgroup of S m consisting of all fc!(n!)* permutations with the property that if 
h € H then ha and hb belong to the same system whenever a and b belong to the 
same system. In other words, H transforms systems into systems. 

The following theorems from [Ref. 12] are stated with proof. 
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Theorem A.l (van Aardenne-Ehrenfest and de Bruijn) H is a V-set 


in S m . 

Let R denote the set of all permutations in S m with the property that the n 
objects of each system are transformed into objects of n different systems. In other 
words, if r € R where a and b belong to the same system, then ra and rb belong to 
different systems. It can also be shown that R is a V-set in S m . 

Let D = (V, A) be a regular Eulerian digraph where in(x) = out(x) = a, 
V r € V. Two cycles are considered identicad whenever the arcs of the first cycle are 
a cyclic permutation of the arcs of the second. 

The number of Eu’ ricm cycles in D is denoted by ||D||. A permutation P 
of the set of arcs of A = (a u ... ,a,) is called conservative (with respect to D ), if 
Pai — aj always implies that a, is adjacent to a,. Let E a be an arbitrary but fixed 
Eulerian cycle in D. The set of all conservative permutations of D can be represented 
as BE 0 , where B = B\ x ... x B\v\ is the group of all permutations where B, is 
the group of permutations that permute arcs having the same initial vertex Vi, but 
leave invariant all other arcs. Therefore, any Eulerian cycle determines a conservative 
permutation and any conservative permutation determines an Eulerian cycle. Hence, 
11,D|| = ||0£ o ||. Let Bi, ..., f?|y| be subsets of B\, ... ,5|v| respectively, where 

B = B x x ... x B\v\- 

The set of Eulerian cycles defined by B, denoted s||D||, are the cycles corresponding 
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to a permutation bE 0 where 6 € B and E 0 is a fixed conservative permutation. Using 
this notation, we have 


1|D||i= 8 HD||, bII^II = \\BE 0 \\i. 

If for each i, B, is a P-set in B,, then B is called normal. The following result 
provides a relationship between the set of all conservative permutations and a subset 
of consvervative permutations. 

Theorem A.2 (van Aardenne-Ehrenfest and de Bruijn) If B is normal, 
then 

bIIPII oIIPII 

\B\ - |B| ’ 

where |J9| and \B\ denote the number of elements of B and B, respectively. 

Recall that D = (V, A) is an Eulerian regular directed graph with \V\ vertices 
and |A| arcs. If in(v t ) = a for 1 < i < jV|, then |A| = <r\V\. 

Let A be a positive integer. Then by D^) we denote the graph that arises from 
D if we replace any arc (x,y) by A arcs (x,y). We see that D(\) has |V| vertices cind 
A|A| = Acr|V| arcs. The set of A arcs arising from each arc in D are said to form a 
bundle. 

We consider 3 categories of cycles that traverse every arc in D(>). Each of 
these categories is normal in the sense described above. 

1. Bi = Bu where Bi is the permutation group of order ((A<r)!). 
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2. Bi = Hi, where Hi is the subset of B, that associates bundles with bundles. 


That is, if a and b are in the same bundle, then ha and hb are in the same bundle 
where h 6 H t . The order of H, is a\(X\) a . 

3. Bi = Ri, where Ri is the subset of B, that associates the arcs of each bundle 
with a different bundles. That is, if a and b are in the same bundle, then ra and rb 
are in different bundles where r € Ri. The order of Ri is |/?|. 

We have by Theorem A.2 

((A<t)!)I v I (<r!)l v l(A!)l £ l |fl| * K 1 

Recall that s||D(a)|| = ||£(a)||i- The number h||Z)(a)|| is proportional to the 
number ||Z?||i. It can be seen that each Eulerian cycle of D arises from A“ 1 (A!) |V| 
different cycles in ||0(A)||- Hence, 

hIIAaiII = a-‘(A!)|>' | |U>||,. 

By Equation A.l we have, 

l|D(»)ll. = A-'IPH, . (A.2) 

A <T-cycle in D is a cycle containing each edge of D exactly a times. A a-cycle 
is called restricted if it happens that any pair of adjacent edges of D appears just once 
in the cycle. The number of different <7-cycles can be determined from Equation A.2. 
A difficulty arises from the fact that a <r-cycle may be periodic of period \E\d, where 
d is a divisor of a. Let c(p) denote the number of ^-cycles in D with the period \E\p. 
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For each p-cycle in D there arise (pl)^ Euierian cycles in D( p ). It follows immediately 


that 

HArtiii = E (A.3) 

A, P 

From the Mobius inversion formula, 

«0>) = E;»‘(2)(< i! )‘ Mi| ll £> Mlli. (A.4) 

d\p " 

and the number of unrestricted p-cyc\es equals 


E«(<0 = ;£*(§) (<fi)- |B| <fl|D M ||i, (A.5) 

d\p P d\fi va/ 

where 4> is Euler’s totient function. Evaluating ||£( p )||i using Equation A.2, the 
number of unrestricted p-cycles in a <7-regular directed graph D — (V, A) is given by 



where <f> is the Euler’s totient function and the summation is extended over all divisors 


of p. 
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APPENDIX B. A DIGRAPH ISOMORPHIC 

TO B n 

A friend is .. .a second self. 

Cicero 

In the Good - de Bruijn digraph B n each vertex is labeled with a unique 
binary n-tuple. There is an arc (x,y) from vertex x = (xo,xi,...,x n _i) to ver¬ 
tex y S (y 0 ,yi,...,y n -i) in B n if and only if (x 1 ,x 2 ,...,x„_i) = (yo,yi, ... ,y n - 2 )- 
Equivalently, we can label each vertex x = (xi, x 2 ,..., x n _i) with the unique integer 

n-1 

i that is the decimal representation of x defined by i = ^ x J 2 n_J ’“ 1 . There is an arc 

j =o 

from vertex i to vertex j if and only if j € {2t,2i + 1} mod 2". 

We define S n _( fc+1 ) for 0 < k + 1 < n, to be a 2-regular directed graph with 
2 «-(fc+i) ver t;i ces an( j 2 n-fc arcs. We label each vertex in B n ^k+i) with a unique 2 fc+1 
element set 

{2 k+1 t,2 k+1 z + l,...,2 k+l i + 2 ( * +I) - 1} where 0 < i < 2 n ~ (fc+1) - 1. 

There is an arc from the vertex {2 fc+1 t, 2 fc+1 i + 1 ,..., 2 k+1 i + 2^ k+1 ^ — 1} to the vertex 
{2 k+1 j, 2 k+1 j + 1 ,..., 2 k+ H + 2- 1} in 5 n _ (jfe+1) if and only if 

2v mod 2” € {(2 k+1 j), (2 k+l j + 1 ),... ,2 k+1 j + 2 ( * +1) - 1}, 

where v 6 {2 k+1 t,2 fc+l i + 1 ,... ,2 fc+1 z + 2 (fc+1) - 1}. 
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Let V(B n ) be the set of vertices in B n where the vertices are represented by 
the integers i where 0 < i < 2 n - 1. Let V{B n _ {k+l) ) denote the set of vertices in 
1 ). Furthermore, let A{B n ) and A(B n ) denote the set of arcs in B n and B n , 
respectively. 

Lemma B.l The function <i>: V(B m ) -4 V(B n _(*+j)) for m = n — (k 4-1), is 
defined by the rule 

v<t>= { 2 fc+ 1 u, 2 * +1 u + l,..., 2 * +1 u + 2 (fc+1) - 1 } mod 2 n . 

Then <j>: V(B m ) -4 V(B n -[k+ 1)) is one to one. 

Proof: Let i,j € V(B m ), with i ^ j. Then 

i<f> = { 2 fc+ 1 i, 2 fc+1 i + l,..., 2 fc+1 t + 2 (fc+ 1 ) -l}mod 2 n , and 
j<t> = {2 k+1 j,2 k+l j + 1 ,. ,.,2 k+1 j + 2 (fc+1) — 1 } mod 2 n . 

Assume i<f> = j<f>. Since 2 n > 2 m , it follows immediately that i = j. Therefore, 

</> is 1 - 1 . ■ 

Lemma B.2 For the rule <fi as defined in Lemma B.l, 4>: V{B n ) —► V(B n -( k + 1 )) 
is onto. 


Proof: Let {2 fc+ 1 i,2* +1 i + 1,...,2 fc+1 i + 2<* +1 > - 1 } mod2 n e V(B n ). Then 
{2* +1 t, 2 fc+1 t + 1 ,..., 2 fe+1 * -)- 2< fc+1 >- 1 } mod 2 n = *(<£). ■ 









Lemma B.3 The function 9: A{B n ) -4 A(£ n _(*+i)) for m = n - (k + l), is 
defined by the rule 

( i,j)0= ({ 2 fc+l »,2 fc+1 z + 1,.. .,2 k+1 i + 2 (fc+1 ^ — 1} mod 2 n , 

{ 2 k+l j, 2* +1 j + 1,..., 2 i+1 ; + 2 (fc+1) - 1} mod 2"), 

where (»',/) € A(B n ) if and only if (i(<t>),j(<t>)) € A(5 n _(*+i)). Then 9: 
A(B n ) -4 A(B n .(k+i)) is one to one. 

Proof: Let (*',/) € A(B n ). with (i,j) # (i',/). Then 

(*,/)$= ({ 2* +l i,2 fc+1 t + 1,..., 2 fc+l i + 2 (fe+1) - 1} mod 2”, 

{ 2 fc+l j,2 fc+1 j + 1,.. .,2* +1 j + 2 (fc+1) - 1} mod 2 n ) 

and 

{i',j')0= ({ 2 fc+1 z',2 fc+1 i / + l,...,2* +1 i , + 2 {fc+1 ) -l}, 

{ 2 fc+1 j',2!* +1 / + 1,...,2 * +1 / + 2 (fc+l) - 1} mod 2"). 

Assume (i, j)0 = (i',j')0. It follows immediately that (t,j) = (*', f). There¬ 
fore, 9 is 1-1. ■ 

Lemma B.4 For the rule 0 as defined in Lemma B.3, 9: A{B n ) -4 A(Z? n _(*+i)) 
is onto. 
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Proof: Let 


({ 2 k+1 i,2* +1 i + 1.2 fc+1 t + 2 |k+1) - 1} mod 2", 

{ 2‘*‘j,2‘ +1 ; + 1,..., 2 M j + 2< MI - 1} mod 2“) € V(5„., 4+11 . 

Then 

({ 2 fc+ ^',2 fc+1 i + l,...,2 fc+1 i + 2 ( * +1) - 1} mod2 n , 

{ 2 k+l j,2 k+l j + l,...,2* +1 j + 2<* +1 > - 1} mod 2") = (,J)(0). 


Theorem B.5 The digraphs B m and 2^ n -(*+i) for m = n - (k + 1), are iso¬ 
morphic. 

Proof: The function <f>: V(B n ) -4 V(B n -(k+i)) defined by the rule 

v4> = {2 k+l v,2 k+l v + 1,.. .,2 k+1 v -I- 2 (fe+I) - 1} mod 2 n 

and the function 0: A(B n ) -4 A(B n ) defined by the rule 

( i,j)0= ({ 2* +1 t,2 fc+1 i + l,...,2 fc+1 z + 2 (fc+l) -l}, 

{ 2 k+1 j,2 k+1 j + l,...,2 fc+1 j + 2 (fc+1) - 1} mod 2 n ) 

are bisections. It follows by definition that the digraphs B n and B n -(fc+i) are 
isomorphic. ■ 
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{04,2,3,4,5,6,7} 

It 

{8,9,10,11,1243,14,15} 

O 

# 4 —( 2 + 1 ) 


o 

{04,2,3} 


{ 4 , 5 , 6 , 7 } 


{8,9,10,11} 


{12,13,14,15} 

O 

■^ 4 —( 1 + 1 ) 



Figure 21. Good - de Bruijn digraphs 5 4 , £?4_(o+i)4^4-(i+i)> ond 5 4 _( 2 +i) 
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